PCX 

INTERNATIONAL APPLICATION PUBLISHED UNI 



WORLD INTELLECTUAL N 
Inlemation 




WO 9602644JU. 



(51) Internatioiial Patent ClassifiGatioo ^ : 




(11 




C12N 15^2, C07K 14/705, 16/18. A61K 
38/17 


Al 


(43) intemati nal l>ublicati n Date: 


1 Febniaiy 1996 (01.02.96) 



(21) International Application Number: PCT/US9S/06493 

(22) Intemational Filing Date: 6 July 1995 (06.(r7.95) 



(30) Priority Data: 
08/277,803 
08/384 J 10 



20 July 1994(20.07.94) US 
1 Ffebniaiy 1995 (01.02,95) US 



(71) Applicant: GENEHCS INSTITUTC. INC. [USAJS]; 87 Cam- 

bridgeParic Drive, Cambridge, MA 02140 (US). 

(72) Inventors: WOOD, Ctive; 2 Hawthorne Place #17R, Boston, 

MA 02 1 1 4 (US), CARUSO, AnAony; 29 Newbuiy Avenue. 
Mediuen, MA 01844 (US). 

(74) Agent: BROWN. Scott, A.; Genetics Institute. Inc.. 87 
CambridgeParic Drive, Cambridge, MA 02140 (US). 



(81) Dedgnated States: AU. CA, JP, MX. European patent (AT, 
BE. CH, DE. DK, ES, FR, GB. GR, IE. IT, LU. MC. NL, 
PT, SE). 



Published 

With intemational search report. 
Before the expiration of the time limit for amending the 
claims and to be repMished in the event of the receipt of 
amendments. 



(54) Title: MUC RECEPTOR TYROSINE KINASES 
(57) Abstract 

Polynucleotides encoding novel leoeptor tyrosine kinases designated "mlk" are disclosed, mlk proteins and ntettods for their 
production, ligands for tiie mlk receptor and methods for their identification, and inhibitors of binding of mik and its ligands and methods 
for their identification are also disclosed. 



FOR THE PURPOSES OF tNFORMATIOS ONLY 



Codes used to Identify States party to the PCT on the front pages of pamphlets publishing international 
applications under the PCT. 



AT 


Austria 


AU 


Antnlii 


BB 

BE 


Bjrindoc 
Belgium 


BF 


BnldDft Fmo 


BG 


Ba|g«ri« 


BJ 


Bcoln 


BR 


BnucO 


BY 


Bel«nB 


CA 


Ctnadi 


CF 


Ceatmt Ajbkn Rcpoblic 


CG 


Oxigo 


CH 


Swlttertisd 


a 


Cdted'Ivoite 


CM 


Cmmoofl 


CN 


Qnu 


CS 


CncboiloviiUa 


CZ 


CndiRepubUe 


DE 


Qenuoy 


DK 
BS 


DeaatA 


n 


Finlisd 


FR 




GA 


Gtbon 



GB 


Uaitsd KjqgdoiB 


GE 


Geoigii 


CN 


GutDM 


GR 


Greece 


HU 


Hungtfy 


IE 


IrelBid 


IT 


fitly 


JP 




KB 


Kenya 


KG 




KP 


nitniciiiMiic Baoite's Republic 




of Koret 


KR 


RepiAdicof Kocet 


KZ 


Kudtettn 


U 




LK 


SriLodn 


tu 


Ijnemboury 


LV 


Urvn 




MOIUKXI 


MD 


Ri^iMkof MoUwk 


MO 




ML 


MtU 


MN 


MangolU 



MR 


MfluritB&ift 


MW 


Malswi 


NE 




NL 


fifrrtimUii rl. 

ncuiciiiuui 


NO 


Nomy 


NZ 


New Zealand 


PL 


Poind 


PT 


nxtcigal 


RO 


Romania 


RU 


Rutaian Pedentkn 


SD 


Sudan 


SB 


Sweden 


sr 


Slovenia 


SK 


Slovakia 


SN 


Sewgal 


TO 


Ottd 


TG 


TDgO 


TJ 




TT 


IVioldad and Tobaso 


UA 


Ukraine 


US 


United SiMei of America 


uz 


Uzbefcittatt 


VN 


Viet Nam 



wo 96/02644 



PCT/US9S/08493 



MLK RECEPTOR TYROSINE KINASES 

This application is a continuation-in-part of application Ser. No. 08/277,803, filed 

July 20. 1994. 

Field of the Invention 

The present invention relates to receptor tyrosine kinases, nucleic acids encoding 
such proteins, ligands thereto and methods of identifying inhibitors of activity of such proteins. 

Backgroun d of the Invep tmn 

The receptors for ahnost all of the growth factors of biological interest belong to 
the tyrosine kinase or hematopoietin receptor families. Receptor tyrosine kinases (RTKs) include 
the receptors for M-CSF, kit ligand, neurotrophins and fibroblast growth factors. RTKs 
typically act as the first link in the chain of a responder cell which leads to biological effects of 
various growth factors. Ligand binding to the extracellular region of the RTK leads to activation 
of its intracellular kinase domain and autophosphorylation of tyrosine residues. This 
phosphorylation causes a cascade of other cellular events which ultimately result in exhibition 
of the activity caused by the ligand. See generally. Yarden el al., Ann. Rev. Biochem. 1988. 
57, 443; Bolen et al.. Advances in Cancer Res., 1991. 57, 103; Fantl et al.. Annu. Rev. 
Biochem.. 1993, 62, 453; Schlesinger et al.. Neuron. 1992. 9. 383. 

Because of their crucial position in the chain of events leading to the manifestation 
of biological activity. RTKs have recently been the subject of extensive study. Several RTKs 
have already been identified and. in many instances, cloned. Identified RTKs include: fill 
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(deVries et al.. Science, 1992, 255, 989); nt4 (Pajusola et al.. Cancer Res., 1992, 52, 5738); 
flkl and flk2 (U.S. Patent Nos. 5.270,458, 5,185,438; WO93/10136; W092/17486; 
WO93/00349; Matthews et al., Cell, 1991, 65, 1143; Rosnet et al.. Oncogene, 1991, 6, 1641); 
RYK (W093/23429); KDR (W092/14748); tie (W093/14124); Torpedo califomica RTK 
(Jennings et al., Proc. Natl. Acad. Sci. USA, 1993. 96, 2895); tyrolO (Lai et al.. Oncogene, 
1994, 9, 877); ptk3 (Sanchez et al., Proc. Natl. Acad. Sci. USA, 1991, 91, 1819); trkb 
(Middlemas et al., Mol. Cell Biol.. 1991, 11, 143); and HEK (Wicks et al., Proc. Natl. Acad. 
Sci. USA, 1992, 89, 1611). 

Some RTKs are differentially expressed in different tissues. For example, flk2 
has been reported to be expressed in primitive hematopoietic cells, but not in mature 
hematopoietic cells. Thus, the differential expression of certain RTKs in a given tissue type may 
indicate a function for that RTK which is specific to such tissue. As a result, identification of 
tissue-specific RTKs may lead to the identification of RTKs involved in tissue-specific processes, 
and subsequently the ligands responsible for activating these tissue-specific processes. 
Therefore, it would be desirable to identify other tissue-specific RTKs to determine their 
relationship to tissue-specific functions. 

Summary of the Invention 

In accordance with the present invention, polynucleotides encoding novel RTKs 
are disclosed. The novel RTKs have been designated "muscle-localized kinase" or "mlk". 
"mlk" is used throughout the present specification to refer to both receptor proteins and 
polynucleotides encoding those proteins and to refer to proteins and polynucleotides from all 

2 
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mammalian species. 

In certain embodiments, the present invention provides for an isolated 
polynucleotide comprising a nucleotide sequence selected from the group consisting of: (a) the 
nucleotide sequence of SEQ ID NO:l from nucleotide 13 to nucleotide 1602; (b) the nucleotide 
sequence of SEQ ID NO: 18 from nucleotide 1 to nucleotide 2580; (c) the nucleotide sequence 
of SEQ ID NO:20 from nucleotide 1 to nucleotide 2604; (d) a nucleotide sequence capable of 
hybridizing to a nucleic acid sequence specified in (a)-(c); (e) a nucleotide sequence varying 
from the sequence of a nucleotide sequence specified in (a)-(c) as a result of degeneracy of the 
genetic code; and (0 an allelic variant of a nucleotide sequence specified in (a)-(c). In particular 
embodiments, the nucleotide sequence encodes for protein having mlk receptor activity and/or 
mlk ligand binding activity. Polynucleotides encoding various maminalian mlk proteins 
(includmg murine and human mlk proteins) are included in the present invention. In other 
embodiments, the nucleotide sequence is operably linked to an expression control sequence. 

The present invention also encompasses host cells transformed with the 
polynucleotide of the invention, including manunalian cells. 

In other embodiments, the invention provides for a process for producing a mlk 
protein, said process comprising; (a) growing a culnire of host cells transformed with a 
polynucleotide of the invention in a suitable culture medium; and (b) purifying the mlk protein 
from the culture, mlk protein produced by such methods is also provided by the present 
invention. 

Further embodiments of the present invention provide isolated mlk protein 
comprising an amino acid sequence selected from the group consisting of: (a) the amino acid 

3 
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sequence of SEQ ID N0;2; (b) the amino acid sequence of SEQ ID N0;2 from amino acids 1 
to 156; (c) the amino acid sequence of SEQ ID N0:2 from amino acids 157 to 177; (d) the 
amino acid sequence of SEQ ID N0:2 from amino acids 178 to 530; (e) the amino acid sequence 
of SEQ ID N0:2 from amino acids 242 to 517; (f) the amino acid sequence of SEQ ID NO: 15; 
(g) the amino acid sequence of SEQ ID NO: 15 from amino acids 22 to 478; (h) the amino acid 
sequence of SEQ ID NO: 19; (i) the ammo acid sequence of SEQ ID NO: 19 from amino acids 
22 to 486; (j) the amino acid sequence of SEQ ID NO: 19 from amino acids 487 to 507; (k) the 
amino acid sequence of SEQ ID NO: 19 from amino acids 508 to 860; (1) the amino acid 
sequence of SEQ ID NO: 19 from amino acids 572 to 847; (m) the amino acid sequence of SEQ 
ID N0:21; (n) the amino acid sequence of SEQ ID NO;21 from amino acids 22 to 494; (o) the 
amino acid sequence of SEQ ID N0:21 from amino acids 495 to 515; (p) the amino acid 
sequence of SEQ ID N0:21 from amino acids 5L6 to 868; (q) the amino acid sequence of SEQ 
ID N0:21 from amino acids 580 to 855; (r) fragments of (a)-(q) having mlk receptor activity; 
and (s) fragments of (a)-(q) having mlk ligand binding activity. Pharmaceutical compositions 
comprising mlk protein and a pharmaceuticatly acceptable carrier are also contemplated by the 
present invention, as are compositions comprising antibodies (polyclonal and monoclonal) which 
specifically react with mlk protein. Polynucleotides encoding each of these proteins are also 
provided by the present invention. 

The present invention also encompasses methods of identifying a mlk receptor 
ligand, said method comprising: (a) providing a sample containing a potential source of mlk 
ligand; (b) contacting said sample with a protein having mlk receptor activity or mlk ligand 
binding activity; and (c) collecting materials binding to the protein, mlk receptor ligands 

4 
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These corresponding polynucleotide fragments are also part of the present invention. Modified 

polynucleotides as described above may be made by standard molecular biology techniques. 

including site-directed mutagenesis methods which are known in the art or by the polymerase 

chain reaction using appropriate oligonucleotide pruners. 

For the purposes of the present invention, a protein has "mlk receptor activit\'" 

when binding of a ligand to such protein results in tyrosine phosphorylation as measured by the 

tyrosine phosphorylation assay described below or other suitable assays. A protein has "mlk 

ligand binding activity" when it binds mlk ligand as measured by the ligand binding assay 

described below or other suitable assays. 

mlk protein or fragments thereof having mlk ligand binding activity may be fused 

to carrier molecules such as inmiunoglobulins. For example, soluble forms of the mlk protein 
such as those fragments comprising amino acids M25 of SEQ ID NO:2 may be fused through 

"linker" sequences to the Fc portion of an immunoglobulin. 

The invention also encompasses allelic variations of the nucleotide sequence as 
set forth in SEQ ID NO:l, that is. naturally-occurring alternative forms of the isolated 
polynucleotide of SEQ ID NO: 1 which also encode proteins having mlk receptor activity or mlk 
ligand binding activity. Also included in the invention are isolated polynucleotides which 
hybridize to the nucleotide sequence set forth in SEQ ID NO: 1 under highly stringent (0.2xSSC 
at 65X), stringent (e.g. 4xSSC at 65*C or 50% formamide and 4xSSC at 42''C), or relaxed 
(4xSSC at 50°C or 30-40% formamide at 42°C) conditions. Isolated polynucleotides which 
encode mlk protein but which differ from the nucleotide sequence set forth in SEQ ID NO: 1 by 
vinue of the degeneracy of the genetic code are also encompassed by the present invention. 

7 
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Variations in the nucleotide sequence as set fortii in SEQ ID NO:l which are caused by point 
mutations or by induced modifications which enhance mlk receptor activity or mlk ligand binding 
activity, half-life or production level are also included in the invention. 

The isolated polynucleotides of the invention may be operably linked to an 
expression control sequence such as the pMT2 or pED expression vectors disclosed in Kauftnan 
et al.. Nucleic Acids Res. 12, 4485-4490 (1991), in order to produce the mlk protein 
recombinantly. Many suitable expression control sequences are known in the an. General 
methods of expressing recombinant proteins are also known and are exemplified in R. Kaufman. 
Methods in Enzymology 185. 537-566 (1990). As defined herein "operably linked" means 
enzymatically or chemically ligated to form a covalent bond between the isolated polynucleotide 
of the invention and the expression control sequence, in such a way that the mlk protein is 
expressed by a host cell which has been transformed (transfected) with the ligated 
polynucleotide/expression control sequence. 

A number of types of cells may act as suitable host cells for expression of the mlk 
protein. Any cell type capable of expressing functional mlk protein may be used. Suitable 
mammalian host cells include, for example, monkey COS cells, Chinese Hamster Ovary (CHO) 
cells, human kidney 293 cells, human epidermal A431 cells, human Colo205 cells, 3T3 cells, 
CV-1 cells* other transformed primate cell lines, normal diploid cells, cell strains deri<^ed from 
in vitro culture of primary tissue, primary explants, HeLa cells, mouse L cells, BHK, HL-60, 
U937, HaK, Rat2, BaF3, 32D, FDCP-L PC12 or C2C12 cells. 

The mlk protein may also be produced by operably linking the isolated 
polynucleotide of the invention to suitable control sequences in one or more insect expression 
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vectors^ and employing an insect expression system. Materials and methods for 
baculovirus/insect cell expression systems are commercially available in kit form from, e.g.. 
Invitrogen, San Diego, California, U.S.A. (the MaxBac® kit), and such methods are well known 
in the art, as described in Summers and Smith, Texas Apricuitural Experiment Station Bulletin 
No. 1555 (1987). incorporated herein by reference. Soluble forms of the mik protein may also 
be produced in insect cells using appropriate isolated polynucleotides as described above. 

Alternatively, the mlk protein may be produced in lower eukaryotes such as yeast 
or in prokaryotes such as bacteria. Suitable yeast strains include Saccharomyces cerevisiae, 
Schizosaccharomyces pombe, Kluyveromyces strains, Candida, or any yeast strain capable of 
expressing heterologous proteins. Suitable bacterial strains include Escherichia coli, Bacillus 
subtilis. Salmonella typhimurium, or any bacterial strain capable of expressing heterologous 
proteins. 

The mlk protein of the invention may also be expressed as a product of transgenic 
animals, e.g., as a component of the milk of transgenic cows» goats, pigs, or sheep which are 
characterized by somatic or germ cells containing a polynucleotide sequence encoding the mlk 
protein. 

The mlk protein of the invention may be prepared by growing a culture 
transformed host cells under culture conditions necessary to express the desired protein. The 
resulting expressed protein may then be purified from the culture medium or cell extracts. 
Soluble forms of the mlk protein of the invention can be purified from conditioned media. 
Membrane-bound forms of mlk protein of the invention can be purified by preparing a total 
membrane fraction from the expressing cell and extracting the membranes with a non-ionic 

9 
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detergent such as Triton X-100. 

The mlk protein can be purified using methods known to those skilled in the an. 
For example, the mlk protein of the invention can be concentrated using a commercially 
available protein concentration filter, for example, an Amicon or Millipore Peliicon 
ultrafiltration unit. Following the concentration step, the concentrate can be applied to a 
purification matrix such as a gel filtration medium. Alternatively, an anion exchange resin can 
be employed, for example, a matrix or substrate having pendant diethylaminoethyl (DEAE) 
groups. The matrices can be acrylamide, agarose, dextran, cellulose or other types conmionly 
employed in protein purification. Alternatively, a cation exchange step can be employed. 
Suitable cation exchangers include various insoluble matrices comprising sulfopropyl or 
carboxymethyl groups. Sulfopropyl groups arc preferred (e.g., S-Sepharose® columns). The 
purification of the mlk protein from culture supernatant may also include one or more column 
steps over such affinity resins as concanavalin A-agarose, heparin-toyopearl® or Cibacrom blue 
3GA Sepharose®; or by hydrophobic interaction chromatography using such resins as phenyl 
ether, butyl ether, or propyl ether; or by immunoaffinity chromatography. Finally, one or more 
reverse-phase high performance liquid chromatography (RP-HPLC) steps employing hydrophobic 
RP-HPLC media, e.g., silica gel having pendant methyl or other aliphatic groups, can be 
employed to further purify the mlk protein. Some or all of the foregoing purification steps, in 
various combinations or with other known methods, can also be employed to provide a 
substantially purified isolated recombinant protein. 

Preferably, the isolated mlk protein is purified so that it is substantially free of 
other mammalian proteins. 

10 
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The mlk proteins of the present invention may also be used to identify and isolate 
mlk binding ligands. For example, mlk proteins having mlk ligand binding activity may be 
anchored to a solid support (such as in a microtiter plate or a binding affinity column) and 
exposed to possible sources of mlk ligands. Species binding to the mlk protein can then be 
eluted from the mlk protein and collected for further characterization. Sources of mlk binding 
ligands include among others blood, cerebrospinal fluid, extracted neuronal, muscular and 
splenic tissues, and membranes and conditioned media from cell lines derived from neuronal, 
muscular and hematopoietic tissue, mlk binding ligands may also be used in phamaceutical 
compositions as described above with, respect to the mlk pix)tein. Ligands may also be identified 
in this manner which bind to the inuracellular region of the mlk protein. 

The invention also includes ligands which bind to the mlk receptor ("mlk receptor 
ligands"). The ligands may be growth fiactors that occur naturally in a manmial (eitfier the same 
mammal or a different manunal from which the mlk receptor to which the ligand binds was 
isolated). The ligand may be isolated and purified, or be present on the surface of ligand- 
expressing cell populations. The ligand may also be a molecule that does not occur nanirally 
in a manunal. The ligand may also be a non-protein molecule that acts as a ligand when it binds 
to or otherwise come into contact with, a mlk receptor protein. In cases where a ligand is a 
protein, polynucleotides encoding such ligands are also within the scope of the present invention. 
Once a protein ligand has been identified, the ligand-encoding polynucleotides may be obtained 
by expression cloning (Wong, Genetic Engineering, ed. by J.K. Setlow, 1990, Plenum Press, 
New York, vol. 12. p. 297) or by synthetic means. 

The mlk proteins of the invention may also be used to screen for agents which are 

11 
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capable of binding lo mik protein (either the extracellular or intracellular domains) and thus may 
act as inhibitors of normal Hgand binding. Binding assays using a desired binding protein, 
immobihzed or not. are well known in the art and may be used for this purpose using the mlk 
protein of the invention. Purified cell based or protein based (cell free) screening assays may 

5 be used to identify such agents. For example, mlk protein may be immobilized in purified form 
on a carrier and binding to purified mlk protein may be measured in the presence and in the 
absence of potential inhibiting agents. A suitable binding assay may alternatively employ 
purified mlk protein immobilized on a carrier, with a soluble form of mlk protein of the 
invention. Any mlk protein exhibiting mlk ligand binding activity may be used in the screening 

10 assays described above. 

In such a screening assay, a first binding mixture is formed by combining a mlk 
binding ligand and mlk protein, and the amount of binding in the first binding mixture (B^) is 
measured. A second binding mixture is also formed by combining a mlk binding ligand, mlk 
protein, and the compound or agent to be screened, and the amount of binding in the second 

15 binding mixture (B) is measured. The amounts of binding in the first and second binding 
mixtures are compared, for example, by performing a B/B^ calculation. A compound or agent 
is considered to be capable of inhibiting binding if a decrease in binding in the second binding 
mixture as compared to the first binding mixnire is observed. The formulation and optitnization 
of binding mixtures is within the level of skill in the art, such binding mixtures may also contain 

20 buffers and salts necessary to enhance or to optimize binding, and additional control assays may 
be included in the screening assay of the invention. 

Compounds found to reduce the binding activity of mlk protein to ligand to any 
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degree, preferably by at least about 10%, more preferably greater than about 50% or more, may 
thus be identified and then secondarily screened in other binding assays and in vivo assays. By 
these means compounds having inhibitory activity for mlk binding which may be suitable as 
therapeutic agents may be identified. 

It is believed that the mlk receptor plays an important role in skeletal muscle 
function. As described below, mlk expression is localized in skeletal muscle, spleen and limg 
(see Northern blot reproduced in Fig. 3). Furthermore, murine mlk maps to the proximal arm 
of murine chromosome No. 4. A mutant murine strain mapping in this proximity, "vacillans" 
(vc), is characterized by half normal muscle mass, gait problems, lack of muscle coordination, 
slowed behavior and reflexes, metabolic abnormalities, mental deficiencies and drastic reduction 
of body fat (see, Sirlin, J. Genetics, 1956, 54, 42). By synteny, it is believed that human mlk 
will map to the p or q arm of human chromosome 9. It has been reported that a condition 
denominated Fukuyama congenital muscular dystrophy (TCMD") maps to human chromosome 
9q31>33 (Toda et ah. Nature Genetics, 1993, 5, 283), the expected locus of human mlk. FCMD 
is characterized by involvement of facial muscle and the central nervous system, brain 
malformation, progressive muscular dystrophy, mental retardation, autosomally recessive 
inheritance and joint conu-actures (Osawa el al.. Acta Paediatr. Jpn., 1991, 33, 261). Familial 
dysautonomia ("DYS"), or Riley-Day syndrome, has also been mapped to human chromosome 
9q31-33 (Blumenfeld et al.. Nature Genetics, 1993, 4, 160). This hereditary disease affects the 
development and survival of sensory, sympathetic and some parasympathetic neurons, DYS is 
characterized by neurological deficiency, with a range of other features including skeletal muscle 
abnormality. The similarity between the mlk receptor and the Torpedo RTK also points toward 

13 
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muscular involvement, since the Torpedo RTK was isolated from the electric organ of electric 
rays which is homologous to muscle tissue. Each of these findings implicates mlk receptor in 
muscle biology, perhaps playing a role in synapse formation and function. 

As a result, isolated mlk proteins and mlk receptor ligands may be useful in 
5 treatment of various medical conditions in which the mlk receptor is implicated or which are 
effected by the activity (or lack thereof) of the mlk receptor (collectively ' mlk-related 
conditions"). Mlk-related conditions include without limitation muscle-related disorders, diseases 
of the nervous system (including infections), vascular disorders, trauma, metabolic 
derangements, demyelinating diseases (including multiple schlerosis), neuronal diseases 

10 (including Alzheimer's disease, Parkinson s disease and Huntington's chorea: and including 
motor neuron diseases such as amyotrophic lateral sclerosis, primary lateral sclerosis and 
Werdnig-Hoffmann disease), epilepsy, syringomyelia, peripheral neuropathy, congenital 
anomalies and tumors. Muscle*related conditons for treatment include without limitation 
muscular dystrophies (such as severe and benign X-linked muscular dystrophy, limb-girdle 

15 dystrophy, facioscapulohumeral dystrohpy, myotonic dystrophy, distal muscular dystrophy, 
progressive dystrophic ophthalmoplegia, oculopharyngeal dystrophy and FCMD), DYS, 
congenital myopathy, myotonia congenita, familial periodic paralysis, paroxysmal 
myoglobinuria, myasthenia gravis, Eaton-Lambert syndrome, secondary myasthenia, and 
denervation atrophy. Given the importance of mlk RTK in muscle, perhaps especially in the 

20 establishment and function of synapses, it is also believed that the mlk protein and ligand will 
be of value in tissue repair. 

It is also believed that the mlk receptor may play a role in the development and 

14 
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growth of hematopoietic cells. As a result, mlk proteins and mlk receptor ligands may be useful 
in treatment of suppressed hematopoiesis^ e.g., as a result of chemotherapy or associated with 
bone marrow transplantation. In addition, they may be usefixl for in vitro and in vivo stimulation 
of hematopoietic cells prior to transplantation, perhaps in peripheral stem cell transplantation. 
Furthermore, they may act to modulate the differentiation and maturation of hematopoietic cells, 
including lymphocytes. Thus, they may be of value, for example, in stimulation of antigen- 
specific cytotoxic T-cell action. 

In situ hybridization experiments have also detected expression of mlk in fetal 
bone. As a result, mlk proteins or mlk receptor ligands may also be useful for treatment of 
various bone-related disorders, including without limitation those associated with bone loss 
(including that associated with osteoporosis, post-menopausal osteoporosis, senile osteoporosis, 
idiopathic osteoporosis, Pagets disease, multipe myeloma, and hypogonadal comlitions). mlk 
proteins or mlk receptor ligands may also be useful as stimulators or inhibitors of osteoclasts and 
other cells involved in bone metabolism. 

The mlk proteins or mlk receptor ligands may also be used to promote the growth 
of bone and cartilage and, therefore, have application in the healing of. bone fracmres and 
cartilage defects in humans and other animals. Such agents may have prophylactic use in closed 
as well as open fracture reduction and also in the improved fixation of artificial joints. De novo 
bone formation induced by such an agent contributes to the repair of congenital, trauma induced, 
or oncologic resection induced craniofacial defects, and also is useful in cosmetic plastic surger>'. 
These agents may also be used in the treatment of periodontal disease, and in other tooth repair 
processes. Such agents may provide an environment to attract bone-forming cells, stimulate 

15 
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growth of bone-forming cells or induce differentiation of progenitors of bone-forming cells. A 
variety of osteogenic, cartilage-inducing and bone inducing factors have been described. See. 
e.g., European patent applications 148,155 and 169,016 for discussions thereof. 

The proteins of the invention may also be used in wound healing and related tissue 
5 repair. The types of wounds include, but are not limited to bums, incisions and ulcers. (See. 
e.g. PCT Publication WO84/01106 for discussion of wound healing and related tissue repair). 

Isolated mlk proteins and mlk receptor ligands, purified from cells or 
recombinantly produced, may be used as a pharmaceutical composition when combined with a 
pharmaceutically acceptable carrier. Such a composition may contain, in addition to mlk protein 

10 or ligand and carrier, diluents, fillers, salts, buffers, stabilizers, solubilizers, and other materials 
well known in the art. The term "pharmaceutically acceptable*' means a non-toxic niaterial that 
does not interfere with the effectiveness of the biological activity of the active ingredient(s). The 
characteristics of the carrier will depend on the route of administration. The pharmaceutical 
composition of the invention may also contain cytokines, lymphokines, or other hematopoietic 

15 factors such as M-CSF, GM-CSF, IL-l, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9. IL-IO. 
in I, IH2, IH5, G-CSF, stem cell factor, and erythropoietin. The pharmaceutical 
composition may contain thrombolytic or anti-thrombotic factors such as plasminogen activator 
and Factor VIII. The pharmaceutical composition may further contain other anti-infIammator\ 
agents. Such additional factors and/or agents may be included in the pharmaceutical composition 

20 to produce a synergistic effect with isolated mlk protein or ligand, or to minimize side effects 
caused by the isolated mlk protein or ligand. Conversely, isolated mlk protein may be included 
in formulations of the particular cytokine, lymphokine. other hematopoietic factor, thrombolytic 
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or anti-thrombotic factor, or anti-inflammatory agent to minimize side effects of the cytokine, 
lymphokine, other hematopoietic factor, thrombolytic or anti-thrombotic factor, or anti- 
inflammatory agent. 

The pharmaceutical composition of the invention may be in the form of a liposome 
in which isolated mlk protein or ligand is combined^ in addition to other pharmaceutically 
acceptable carriers, with amphipathic agents such as lipids which exist in aggregated form as 
micelles, insoluble monolayers, liquid crystals, or lamellar layers which in aqueous solution. 
Suitable lipids for liposomal formulation include, without limitation, monoglycerides, 
diglycerides, sulfatides, lysolecithin, phospholipids, saponin, bile acids, and the like. 
Preparation of such liposomal formulations is within the level of skill in the art, as disclosed, 
for example, in U.S. Patent No. 4,235,871; U.S. Patent No. 4,501,728; U.S. Patent No. 
4,837,028; and U.S. Patent No. 4,737,323, all of which aie incorporated herein by reference. 

As used herein, the term "therapeutically effective amount" means the total 
amount of each active component of the pharmaceutical composition or method that is sufficient 
to show a meaningful patient benefit, e.g., amelioration of symptoms of, healing of, or increase 
in rate of healing of such conditions. When applied to an individual active ingredient, 
administered alone, the term refers to that ingredient alone. When applied to a combination, the 
term refers to combined amounts of the active ingredients that result in the therapeutic effect, 
whether administered in combination, serially or simultaneously. 

In practicing the method of treaunent or use of the present invention, a 
therapeutically effective amount of isolated mlk protein or ligand is administered to a mammal. 
Isolated mlk protein or ligand may be administered in accordance with the method of the 
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invemion either alone or in combination with other therapies such as treatments employing 
cytokines, lymphokines or other hematopoietic factors. When co-administered with one or more 
cytokines, lymphokines or other hematopoietic factors, isolated mlk protein or ligand may be 
administered either simultaneously with the cytokine(s), lymphokine(s), other hematopoietic 
5 factor(s), thrombolytic or anti-thrombotic factors, or sequentially. If administered sequentially, 
the attending physician will decide on the appropriate sequence of administering isolated mlk 
protein or ligand in combination with cytokine(s). lymphokine(s), other hematopoietic factor(s). 
thrombolytic or anti-thrombotic factors. 

Administration of isolated mlk protein or ligand used in the pharmaceutical 

10 composition or to practice the method of the present invention can be carried out in a variety 
of conventional ways, such as oral ingestion, inhalation, or cutaneous, subcutaneous* or 
intravenous injection. Intravenous administration to the patient is preferred. 

When a therapeutically effective amount of isolated mlk protein or ligand is 
administered orally, isolated mlk protein or ligand will be in the form of a tablet, capsule, 

15 powder, solution or elixir. When administered in tablet form, the pharmaceutical composition 
of the invention may additionally contain a solid carrier such as a gelatin or an adjuvant. The 
tablet, capsule, and powder contain from about 5 to 95% isolated mlk protein or ligand. and 
preferably from about 25 to 90% isolated mlk protein or ligand. When administered in liquid 
form, a liquid carrier such as water, petroleum, oils of animal or plant origin such as peanut oil. 

20 mineral oil, soybean oil, or sesame oil, or synthetic oils may be added. The liquid form of the 
pharmaceutical composition may further contain physiological saline solution, dextrose or other 
saccharide solution, or glycols such as ethylene glycol, propylene glycol or polyethylene glycol. 
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When administered in liquid form, the pharmaceutical composition contains from about 0.5 to 
90% by weight of isolated mik protein or ligand, and preferably from about I to 50% isolated 
mlk protein or ligand. 

When a therapeutically effective amount of isolated mlk protein or ligand is 
administered by intravenous, cutaneous or subcutaneous injection, isolated mlk protein or ligand 
will be in the form of a pyrogen-free, parenteral ly acceptable aqueous solution. The preparation 
of such parenterally acceptable protein solutions, having due regard to pH. isoionicity, stability, 
and the like, is within the skill in the art. A preferred pharmaceutical composition for 
inu-avenous, cutaneous, or subcutaneous injection should contain, in addition to isolated mlk 
protein an isotonic vehicle such as Sodium Chloride Injection, Ringer's Injection, Dextrose 
Injection,. Dextrose and Sodium Chloride Injection, Lactated Ringer's Injection, or other vehicle 
as known in the art. The pharmaceutical composition of the present invention may also contain 
stabilizers, preservatives, buffers, antioxidants, or other additive known to those of skill in the 
art. 

The amount of isolated mlk protein or ligand in the pharmaceutical composition 
of the present invention will depend upon the nature and severity of the condition being treated, 
and on the nature of prior treatments which the patient has undergone. Ultimately, the attending 
physician will decide the amount of isolated mlk protein or ligand with which to treat each 
individual patient. Initially, the attending physician will administer low doses of isolated mlk 
protein or ligand and observe the patient's response. Larger doses of isolated mlk protein or 
ligand may be administered until the optimal therapeutic effect is obtained for the patient, and 
at that point the dosage is not generally increased ftjrther. It is contemplated that the various 
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pharmaceutical compositions used to practice the method of the present invention should contain 
about 0.1 fig to about 100 mg of isolated mlk protein or ligand per kg body weight. 

The duration of intravenous therapy using the pharmaceutical composition of the 
present invention will vary, depending on the severity of the disease being treated and the 
condition and potential idiosyncratic response of each individual patient. It is contemplated that 
the duration of each application of the isolated mlk protein or ligand will be in the range of 12 
to 24 hours of continuous intravenous administration. Ultimately the attending physician will 
decide on the appropriate duration of intravenous therapy using the pharmaceutical composition 
of the present invention. 

Isolated mlk protein of the invention may also be used to immunize animals to 
obtain polyclonal and monoclonal antibodies which specifically react with the mlk protein and 
which may inhibit ligand binding to the mlk receptor. Such antibodies may be obtained using 
the entire mlk protein as an immtmogen, or by using fragments of mlk protein such as ttie 
soluble mature mlk protein. Smaller fragments of the mlk protein may also be used to inununize 
animals. The peptide immunogens additionally may contain a cysteine residue at the carboxyl 
terminus, and are conjugated to a hapten such as keyhole limpet hemocyanin (KLH). Additional 
peptide immunogens may be generated by replacing tyrosine residues with sulfated tyrosine 
residues. Methods for synthesizing such peptides are known in the art, for example, a& in R.P. 
Merrifield, J.Amer.Chem.Soc. §5. 2149-2154 (1963); J.L. Krstenansky, et al., FEBS Leu. 211^ 
10 (1987). 

Neutralizing or non-neutralizing antibodies (preferably monoclonal antibodies) 
binding to mlk protein or mlk ligand may also be useful therapeutics for certain tumors and also 
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in the treatment of conditions described above. These neutralizing monoclonal antibodies are 
capable of blocking the ligand binding to the mlk protein. 



Example 1 
Isolation of Murine mlk cDNA 
A fragment of the murine mlk cDN A was isolated using PCR amplification of first 
suand cDNA with primers specific to conserved motifs of receptor tyrosine kinases. Bone 
marrow from C57BL/6J mice treated two days previously with 150 mg/kg 5-fluorouracil was 
depleted of 0220+, Grl + , CD3+ and Macl-h cells, and then the cells bearing the Sca + 
antigen were isolated by fluorescence-activated cell sorting (FACS). Total cellular RNA was 
purified from the sorted cells using the acid phenol technique. 

First strand cDNA was synthesized using random hexamer primers and Moloney 
murine leukemia virus reverse transcriptase (Perkin Ebner GeneAmp RNA PCR reagents). The 
PCR scheme used to isolate the murine cDNA is summarized in Fig, 2. Hie cDNA was 
amplified using the primers of SEQ ID NO:3 and SEQ ID N0:4. The SEQ ID N0:3 and SEQ 
ID NO:4 primers were directed to the IHRDL and SDVWS motifs, respectively. The reaction 
was carried out in a Perkin Elmer 9600 Themial Cycler, using the following conditions: 94 °C 
for 30 sec, 37°C for 15 sec, 72^C for 15 sec (5 cycles): 94^C for 30 sec, SO^'C for 15 
sec, 72**C for 15 sec (25 cycles). 

The product of this amplification was purified by Ccntricon 30 microcentrifugation 
(Amicon), and then amplified again using the primers of SEQ ID N0:5 and SEQ ID NO:6, 
directed to the DLAAR and WLALES motifs, respectively. The reaction was carried out under 
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the conditions described above. 

The second primers contained EcoRI sites at their 5' ends to facilitate sub-cloning. 
Thus, the products of the second reaction were digested with EcoRI, and examined by gel 
electrophoresis using low melting temperature agarose and ethidium bromide staining, the 
S approximately ISO bp band expected was excised and eluted. The isolated fragments were 
ligated into EcoRI digested pBluescript KSI1+ (Stratagene), and used to transform competent 
InvaF' E. coli (Invitrogen). Transformants were selected on L-amp plates containing 100 ^g/m] 
ampicillin. and then randomly selected for miniplasmid preparations and analysis by nucleotide 
sequencing. 

10 The motifs of the first PCR are highly conserved in protein tyrosine kinases 

(Hanks and Quinn, Methods in EnzymoL, 1991, 200, 38) and this amplification selects for a 
population of cDNAs derived from cytosolic and receptor tyrosine kinases. The motifs of the 
second PCR are specific to RTKs. Analysis of products of the second PCR amplification 
showed this strategy to be successful. No cytosolic tyrosine kinases were obtained. However, 

15 previously described RTKs, such as flk-1, flk-2 and c-kit. were found. The sequence of one 
iransformant was found to be novel and not previously described. 

Longer cDNA clones of this novel sequence, including pred2#2 (SEQ ID NO:l), 
were obtained from a murine embryo (day 15) cDNA library in the lambda SHlox vector 
obtained from Novagen (Madison, WI), This library was hybridized with an oligonucleotide 

20 probe derived from the novel transformani (SEQ ID N0;7), which was radiolabelled with 
gamma-^-P-ATP. pred2#2 was deposited as an EcoRI/Hindlll insen in pSHlox (Novagen, 
Wisconsin) with ATCC on July 20. 1994 as accession number 69659. 
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Example 2 
Localization of mlk Expression 
Expression of mlk mRNA is examined by Northern blot hybridization analysis of 
RNA isolated from primary tissues and cell lines. Cellular RNA is isolated by conventional 
techniques such as acid phenol isolation, guanidinium thiocyanate denaturation/density gradient 
centrifugation in cesium chloride or RNA Stat-60 isolation (Tel-Test Inc., TX). The mRNA 
fraction is selected on the basis of binding to oligo dT,2.i8, using either oligo dT-cellulose 
chromatography, or binding to oligo dT-biotin and then magnetic separation after binding to 
streptavidin-paramagnetic particles (Promega, Inc., WI). RNA is exposed to formaldehyde- 
agarose gel electrophoresis and then transferred to nitrocellulose or nylon membranes by 
capillary blotting. Northern blot filters of murine and human tissue RNAs can also be purchased 
from Clontech, CA. 

The first northern blot analysis of murine tissue RNAs was carried out with a 
murine mlk cDNA probe labelled with a-^P^CTP by random hexamer priming and the Klenow 
fragment of £. coli DNA polymerase I. Two mRNA species were found to hybridize to this 
probe, of approximately 7.0 and 4.0 kb. Both mlk mRNAs were detected in spleen and lung 
poly A* RNA, and at higher levels in skeletal muscle poly A* RNA. Subsequent northern blot 
experiments have not detected expression of mlk mRNAs in spleen or lung tissue and have 
confirmed high levels of mlk expression in skeletal muscle. 

Analysis of mlk mRNA expression by northern blot in murine embryo poly A" RNA 
revealed that mlk is expressed at least as early as day 7 after fertilization. 

Expression of mlk mRNA is also examined in tissue sections or whole mouni 
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embryos by in situ hybridization with radiolabeiled or chemically modified RNA probes. 
Antisera against mlk protein are generated by immunization of rabbits, mice, hamsters, goats 
or sheep with synthetic mlk peptides, such as the C-terminal 12-20 residue peptide conjugated 
to carrier. Antibodies against the extracellular portion of mlk are obtained by immunization with 
recombinant fusion proteins such as mlk-thioredoxin, mlk-maltose-binding protein, mlk- 
glutathione-S-transferase. These antibodies are used to localize mlk protein by 
immunohistochemistry. Antibodies against the extracellular region of mlk are used to isolate 
mlk expressing cells, e,g,, from bone marrow or muscle, by fluorescence-activated cell soaing. 
in conjunction with a secondary antibody that recognizes the primary anti-mlk, and is conjugated 
to a fluorochrome such as phycoerythrin or fluorescein isothiocyanate. Using additional 
antibodies conjugated to fluorochromes against known cell surface markers in conjunction with 
the ariti-mlk protein will permit a phenotyping of the subsets of cells expressing mlk. 

The expression of mlk polypeptide is also examined by immunopi^ipitation and 
in vitro kinase assay. Cell or tissue extracts are prepared in 1 % (v/v) Nonidet P-40, 10% (v/v) 
glycerol^ 150 mM sodium chloride, 20 mM Tris-HCl pH 8.0, or alternatively in 1% (v/v) 
Nonidet P-40, 1 % (w/v) sodium deoxycholate, 0.1% (w/v) SDS, 25 mM Tris-HCl pH 8.0, 150 
mM sodium chloride. Both solutions are supplemented with 1 mM sodium vanadate. 1 ^e/ml 
leupeptin, 1 mM Pefabloc SC, 1 figfml aproiinin. Antisera raised against the C-ierminal mlk 
peptide and protein A-Sepharosc are then used to precipitate the mlk polypeptide. The 
immunoprecipiiaie is washed three times in the above extraction buffer, then three times in 
kinase assay buffer (20 mM HEPES, pH 7.3, 10 mM manganese chloride, 5 mM magnesium 
chloride) and then resuspended in 50 m1 kinase assay buffer containing 2 ^1 of 7-^-P-ATP (10 
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/xCi/Ml; 1.000 Ci/nmiol). After incubation for 15 minutes, the immunoprecipiute is washed 
three times in 20 mM HEPES. pH 7.3. 5 mM ethylene glycol-bis(/3-amino-ethyI-cther) 
N.N.N'.N'-tetraacetic acid (EGTA). ImM rATP. and then boiled for 5 minutes in 1 % SDS. 100 
mM Tris-HCI pH 6.8, 50 ^il/ml /3-mercaptocthanol. prior to SDS-PAGE electrophoresis and 
autoradiography.. 

Examnle "K 
Chromosom al Mapping of Murine mlk 
Murine mlk was chromosomaliy mapped by single-strand conformation 
polymorphism analysis (Beier. Mammalian Genome. 1993. 4. 627) using the following 
oligonucleotides: 

IF: CAGACTGTGAGCTGGAGGAAC (SEQ ID NO:8) 

IR: GGATTATGTAGGAACGTAACC (SEQ ID NO:9) 

2F: CGAAATAGGTTGGAGATACAGG (SEQ ID NO: 10) 

2R: CGTGTCTTGACTGGTGGAAGAAGG (SEQ ID NO: II). 

The murine mlk gene was mapped genetically by SSCP analysis in the BxD 
recombinam inbred (Rl) series. The murine mlk gene was mapped to the proximal arm of 
murine chromosome 4. By synieny. this region is predicted to correspond ta human 
chromosome 9, p or q. 

Example 4 
Assay for mlk Receptor Activity 
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Celis expressing mlk protein are used in tyrosine phosphorylation assays for mik 
receptor activity. NIH 3T3 or Rai2 cells are stably transfected with mlk expression plasmids. 
For 12-24 hours prior to the assay, the cells are maintained under standard growth conditions, 
except that onJy 0.5% fetal bovine serum is used in the culture medium. Monolayers of these 
cells, or suspensions prepared by detachment of cells by brief treatment with 20 mM EGTA in 
PBS, are stimulated for 5 min. at 37**C with ligand or antibodies directed to the extracellular 
region. Cells are then pelleted, and solubilized in either ice-cold 1% (v/v) Nonidet P-40, 1% 
(w/v) sodium deoxycholate, 0.1% (w/v) SDS, 25 mM Tris-HCl pH 8.0, 150 mM sodium 
chloride, or ice-cold 1% (v/v) Nonidet P-40, 10% (v/v) glycerol. 20 mM Tris-HCl pH 8.0, 150 
mM sodium chloride. Both extraction buffers are supplemented with 1 mM sodium vanadate. 
1 Mg/ml leupepttn, 1 ^g/ml aprotinin, 1 mM Pefabloc SC. 

The solubilized proteins are then immunoprecipitated with rabbit antibodies 
directed against the mlk protein, such as the C-tenninal peptide epitope, followed by protein A- 
sepharose. The immunoprecipitated material is then exposed to reducing SDS-PAGE 
electrophoresis and transferred to a nitrocellulose filter by Western blotting. 

The amount of phosphotyrosine in the immunoprecipitated mlk protein is then 
assayed by incubation of the filter with a monoclonal antibody specific for phosphotyrosine (e.g. . 
the 4G10 monoclonal antibody available from Upstate Biotechnology, New York). The antibody 
is either directly conjugated to horseradish peroxidase, or used as a biotin conjugate, followed 
by incubation with strepiavidin-horseradish peroxidase. Binding is visualized with enhanced 
chemiluminescence reagents (commercially available from Amersham, Illinois) and exposure to 
x-ray film. 
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An increase in phosphotyrosine in samples treated (e.g., with a potential ligand 
source) relative lo samples treated only with fresh medium, are taken to indicate induction of 
receptor activity. Positive samples can be further studied by examining the time course of 
phosphotyrosine induction on the mlk polypeptide. 

Induction of phosphotyrosine on mlk protein can also be examined by 
immunoprecipitation with anti-phosphotyrosine conjugated to agarose, followed by Western 
blotting with anti-phosphotyrosine, or Western blotting of whole cell extracts with ami- 
phosphotyrosine. 

Techniques for the study of phosphotyrosine on proteins have been described 
(Cooper et al.. Methods in Enzymol., 1983 , 99, 387). 

Example 5 
Assay for mlk Lipand Binding Activity 

Binding of the mlk ligand to the extracellular region of the mlk receptor protein 
will specifically cause a rapid induaion of phosphotyrosine on the receptor protein. Assays for 
ligand binding activity as measured by induction of phosphorylation is described in Example 4. 

Activation of the receptor by ligand binding can also be determined by measuring 
cell growth or differentiation. For example, NIH 3T3 or Rat2 fibroblasts are stably iransfecied 
with expression vectors directing synthesis of mlk protein. A mitogenic response in these 
iransfected cells as a result of ligand binding is measured by examining their growth in semi- 
solid medium, as described for other RTKs expressed in transfected NIH 3T3 or Rat2 cells 
(Cordon-Cardo et al.. Cell. 1991. 66173; Glass el al.. Cell. 1991, 66. 405: Dosil et al.. Mol. 
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Cell Biol., 1993, 13. 6572; Maroc ei al.. Oncogene, 1993, 8, 909). In the presence of binding 
ligand, these cells will acquire the ability to grow in semi-solid media. 

Induction of growth can also be measured by examining c-fos induction. The 3T3 
or Rat2 cells are placed in 0.5% fetal bovine serum in DMEM for 24 hours prior to ligand 
addition. Addition of ligand or serum (positive control) will cause an induction of expression 
of the c-fos inunediate response gene. Samples are harvested at zero time (prior to ligand 
addition), 30 min., 1 hour, 2 hours and 3 hours after ligand or senim addition. Cells harvested 
are used to isolate total cellular RNA, which is then examined by Northern blot analysis with 
a c-jfos cDNA probe. 

Alternatively, a soluble form of the mlk extracellular region is produced and used 
to detect ligand binding. A DNA construct is prepared in which the extracellular region 
encoding the mlk cDNA (truncated prior, preferably immediately prior, to the predicted 
transmembrane domain) is ligated in frame to a cDNA encoding the hinge Ch2 and C„3 domains 
of a human immunoglobulin (Ig) 7I . This construct is generated in an appropriate expression 
vector for COS cells, such as pEDAC or pMT2. The plasmid is transiently transfected into COS 
cells. The secreted mlk-Ig fusion protein is collected in the conditioned medium and purified 
by protein A chromatography. 

The purified mlk-Ig fusion protein is used to demonstrate ligand binding in a 
number of applications. The soluble ligand can be coated onto the surface of an enzyme-linked 
immunosorbent assay (ELISA) plate, and then additional binding sites blocked with bovine serum 
albumin or casein using standard ELISA buffers. The mik-lg fusion protein is then bound to 
the solid-phase ligand, and binding is detected with a secondary goat anti-human Ig conjugated 
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10 horseradish peroxidase. The activity of specifically bound enzyme can be measured with a 
colorimetric substrate, such as tetramethyl benzidine and absorbance readings. 

The mlk ligand may also be expressed on the surface of cells, for example b\ 
possessing a transmembrane domain or glucosyl phosphatidyl inositol (GPI) linkage. Cells 
expressing the membrane bound ligand can be identified using the mlk-Ig fusion protein. The 
soluble mlk-Ig fusion is bound to the surface of these cells and detected with goat anti-human 
Ig conjugated to a fluorochrome, such as fluorescein isothiocyanate and flow cytometr>'. 

Other mlk proteins may be tested for mlk ligand binding activity by forming 
expression of such other proteins as Ig fusion proteins as described herein. 

Example 6 

Isolation of Full-Length Murine mlk cDNA 

The 5' end of the murine mlk can be isolated from random hexamer primed 
cDNA libraries. Murine skeletal muscle or whole embryo (day 15 post-fertilization) RNA is 
prepared and used to synthesize double-stranded cDNA. Priming of first strand cDNA with 
random hexamers rather than oligo dT,2.i8 will ensure the representation of the 5' end of mlk in 
the librar>'. The cDNA can be ligated with Eco RI adaptors and cloned into an EcoRI digested 
bacteriophage X vector such as X ZIPLOX (Gibco-BRL, MD) or X ZAPII (Stratagene, CA). 
The resulting plaques can be screened with a radiolabelled synthetic oligonucleotide probe, 
derived from the 5' end of the available panial cDNA sequence. Alternatively, the library can 
be screened with a 300-400 bp restriction fragment from the 5' end of the available cDNA. 

A murine Balb/c strain skeletal muscle cDNA library (oligo dT and random 
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hexamer primed) in Xgt 10 was used (purchased from Clontech, CA: catalog number ML 
3006a). A 5' fragment of pred2#2 was isolated using PCR with primers: 
5' ACACTGCGTGGAATGAGCTGA 3^ (SEQ ID NO: 16) and 

5' AAATATGGCAGTCTTGTGCA 3* (SED ID NO: 17). This isolated PCR fragment was 
labelled with a-^^P-dCTP and the Klenow fragment of DNA polymerase I with random hexamer 
priming. The labelled fragment was used to screen the Xgt 10 skeletal muscle library by 
hybridization at high stringency. Single hybridizing bacteriophage were purified. Lambda DNA 
was purified with Wizard Lambda reagents (Promega, WI), and digested with EcoRl. An EcoRI 
insert fragment was subcloned into pBluescriptllKS + (Stratagene, CA), and examined by 
nucleotide sequencing. A cDNA clone called "5'mlk #3" was identified by this method. The 
nucleotide sequence of 5'mlk#3 is provided as SEQ ID NO: 14. The amino acid sequence 
encoded thereby is provided as SEQ ID NO: 15. 

5'nilk#3 was deposited as an EcoRI insert in pBIuescriptIIKS+ with ATCC on 
January 11, 1995 as accession number 69741. 

5'mlk#3 is a partial cDNA for the full length mlk RTK which includes the 5' 
end. The 5'mlk#3 sequence and previous red2#2 sequence overlap in a region of the 
extracellular region in juxtaposition to the transmembrane domain. The overlap region mns 
from about amino acids 33M78 of SEQ ID NO: 15 (5'mlk#3) and from about amino acids 1-140 
of SEQ ID NO:2 (red2#2). 

Alignment of the sequences reveals a 24 bp nucleotide insertion of 5 'mlk#3 that 
is not present in red2#2. This insert encodes an additional 8 amino acids in frame (amino acids 
454-461 of SEQ ID NO: 15). The first residue after the insert (amino acid 462 of SEQ ID 
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NO: 15) also differs from that of redlffl (amino acid 124 of SEQ ID NO:2). Thus, there are two 
isoforms of the mlk RTK, which are designated as "mlk-1" and "mlk-2", which differ by not 
containing or containing this insert, respectively. Similar isoforms of the KIT and TRK A RTKs 
have been described in the literature (Crosier ei al.. Blood, 1993, 82, 1151; Reith et al., EMBO 
J., 1991, 10, 2451; Barker et aL, J. Biol. Chem., 1993 , 268, 15150)). Differences in the 
isoforms may relate to ligand binding, receptor dimerization or activation. 

Full-length cDNAs for both isoforms are made by ligating ponions of the 5'mlk#3 
and red2#2 sequences. Briefly, both cDNAs are cleaved at the same point in the region of the 
sequence overlap with an appropriate restriction enzyme. The 3* end of the 5'mlk#3 fragment 
containing the 5' end of the entire mlk coding sequence is then ligated to the 5' end of the 
red2#2 fragment containing the 3* end of the entire mlk coding sequence. 

The nucleotide and amino acid sequences for full-length mlk-1 isoform are 
provided at SEQ ID NO: 18 and SEQ ID N0:19, respectively. The nucleotide and amino acid 
sequences for luil-length mlk-2 isoform are provided at SEQ ID NO;20 and SEQ ID N0:21, 
respectively. In SEQ ID N0;21 (mlk-2), the 8 amino acid insert is found at amino acids 454- 
461 and amino acid 462 is a threonine residue (the corresponding residue in SEQ ID NO: 19 
(mlk-1) is alanine). 

Table 1 identified the locations of the mlk protein domains in various forms of 
mlk protein identified herein. Domains are identified by reference to amino acid numbers in the 
indicated sequences. The location of the leader sequence aand transmembrane domains was 
predicted using computer sequence analysis algorithms (leader sequence: Von Heijne, 1987. 
Sequence Analysis in Molecular Bioilogy: Treasure Trove or Trivial Pursuit (New York: 
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Academic Press): transmembrane domain: Klein ei al., 1985, Biochim, Biophys. Acta. 815, 
468). 



Table 1 

Identification of mlk Protein Domains 





SEQ ID N0:2 


SEQ ID NO: 15 


SEQ ID NO: 19 


SEQ ID N0:21 


leader 


N/A 


1-21 


1-21 


1-21 


extracellular 


1-156 


22-478 


22-486 


22-494 


transmembrane 


157-177 


N/A 


487-507 


495-515 


intracellular 


178-530 


N/A 


508-860 


516-868 


soluble 


1-125; 1-156 


N/A 


22-453; 22-486 


22-462; 22-494 


tyrosine kinase 


242-517 


N/A 


572-847 


580-855 



Example 7 
Isolation of Human mlk cDNA 
The human mlk cDNA can be isolated by cross-hybridization with the murine mlk 
cDNA. There is a high degree of cross-species sequence conservation between RTK genes, 
especially in the region encoding the kinase domain. An example is the similarity between 
murine mlk and the Torpedo RTK cDNA sequences (see Fig. 1). The Torpedo RTK nucleotide 
and amino acid sequences are reponed as SEQ ID N0:12 and SEQ ID N0:I3. respectively. 
Human cDNA libraries in appropriate bacteriophage X vectors such as \ ZAPII can be 
hybridized with mlk cDNA probes. These libraries can be purchased from Stratagene, 
California and Clontech, California. The commercially available libraries include human muscle 
cDNA libraries that would be expected to be a good source. If necessary, the complete human 
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mlk gene can be isolated from human genomic libraries. Such genomic clones would also 
provide ideal probes for screening human cDNA libraries at high stringency. 

The human cDN A libraries are hybridized with a high specific activity ^^P-labelled 
mlk cDNA probe at low stringency (6 x SSC, 0.1% SDS; 45^C) for 36-48 hours using 
5 approximately 2 x 10* cpm/ml of probe. The plaque DNA is immobilized on nitrocellulose 
filters, with approximately 20,000 plaques per 137 mm plate. After the hybridization, the filters 
are washed in four changes of 4 X SSC, 0. 1 % SDS at room temperanire over 90 minutes, with 
gentle rotation. Then, the filters are placed in 4 X SSC, 0.1% SDS at 55**C for 30 minutes. 
The filters are air dried and exposed to x-ray film with two intensifying screens at -SO^C, for 

10 24-72 hours. Positive plaques are isolated, re-plated and screened by hybridization until a 
single, pure recombinant bacteriophage plaque that hybridizes to the probe is isolated, and then 
analyzed by nucleotide sequencing. 

The stringency of the hybridization and filter washing may be increased or 
decreased to optimize signal detection and signal to background ratio. The exact conditions for 

IS cross-hybridization of murine mlk cDNA annealing to the human mlk sequence may also be 
refined by examining the hybridization of the radiolabelled mlk probe to human genomic DNA 
Southern blots. Identical Southern blot filters containing Eco RI and Hind III digested human 
genomic DNA are hybridized at 5 X SSC, 0.1% SDS at 45*'C, and then washed in 5 X SSC, 
0, 1 % SDS at room temperature for 90 minutes. A range of different washing stringencies are 

20 then employed over a range of SSC concentrations and temperature conditions. Each wash 
includes 0.1% SDS, and is carried out for 30 minutes (two changes). In this way, optimal 
conditions can be found, and then used for cDNA library screening. 
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An alternative way of isolating the human mlk cDNA is to amplify a ponion of 
it with degenei^le primers based on homologies inferred from alignment of murine mlk and 
Torpedo RTK. The template for this PCR amplification is first strand cDNA synthesized from 
human poly A* RNA (purchased from Clontech. CA) bacteriophage \ cDNA libraries or human 
genomic DNA (Promega. WI). The resulting PCR product is cloned and sequenced, and then 
represents a probe for hybridizing human cDNA and genomic libraries at high stringency. The 
unique sequence can also be used to isolate the remainder of the human mlk cDNA by 5" or 3' 
anchored PCR (Loh et al.. Science, 1989. 243. 217; Ohara et al.. Proc. Natl. Acad. Sci. USA. 
1989, 86, 5673; reviewed in Amheim et al.. Annu. Rev. Biochem.. 1992, 131). 

Example 8 

Expressi on of Murine and Human mlk cDNA 
The mlk-1 cDNA is ligated into the pcDNA3 expression vector anvitrogen. CA). 
such that the mlk coding region was under the control of the CMV promoter. This vector also 
contains an SV40 origin, and can be used as a replicating vector in COS cells. COS cells are 
transienUy transfecied with pcDNA3/mlk-l or as negative controls: an expression vector for Flt4 
or no DNA (" mock"). 

After 48 hours, cell lysates are prepared in ice-cold 1 % (v/v) Nonidet P-40, 20 
mM Tris-HCI pH 8.0. 150 mM sodiumchloride. 10% (v/v) glycerol, 1 mM sodium vanadate. 
I /ig/ml leupeptin, 1 mM Pefabloc SC, 1 /xg/ml aprotinin. and then exposed to reducing SDS- 
PAGE, and transferred to a nitrocellulose filter by western blotting. The filter is blocked with 
5% (w/v) non-fat dried milk. 0.1% Tween 20 in TBS (20mM Tris-HCI pH 7.5. 150 mM .sodium 
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chloride) overnight at 4**C. The filter is washed in TBS/0.3% Tween 20, and incubated for 1 
hr. at ambient temperature with rabbit aniisera against the C-terminal mlk peptide. Then the 
filter is washed in TBS/0.3% Tween 20 and TBS/0.1% Tween 20. Binding of the antisera is 
detected with goat anti-rabbit IgG conjugated to horseradish peroxidase. After additional 
washing, the localized peroxidase is detected with Amersham enchanced chemiluminescence 
reagents and x-ray film. 

The mlk protein may also be expressed as follows: 

The mlk cDNA is ligated into a number of mammalian cell expression vectors. 
For u^nsieni expression in COS cells, the mlk cDNA is cloned into pED or related plasmids that 
place the cDNA under the control of the adenovirus major late promoter with SV40 enhancer 
and origin, and adenovims VA RNA genes. The same vector is used for gene amplification in 
Chinese hamster ovary cells (dhfr-), using the linked DHFR gene to co-amplify mlk expression 
with selection in reiteratively increasing concentrations of methotrexate (Kaufman, Methods in 
EnzymoL, 1990, 185, 537). 

Expression in cell lines such as factor-dependent BaF3, 32D and FDCP-1, and 
in PC12 and C2C12 is obtained with cytomegalovirus immediate early, Rous sarcoma virus long 
terminal repeal, or /J-actin promoter sequences. Examples of such vectors are pcDNA3 
(Invitrogen. CA) and pDR2 (Clontcch, CA). Stable transfectams are selected with G418 or 
hygromycin B, using neo^ or hyg B marker genes incorporated into the expression vectors. 

Expression of the RTK is monitored by Northern Blot analysis or ribonuclease 
protection assay of cellular RNA from transfectams. Antibodies to the murine or human protein 
can also be prepared and used to examine expression, by immunoprecipitation of bios> ntheticall> 
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radiolabelled cell extracts. Western blot analysis of cell extracts, or flow cytomeirx'. 

Example 9 
Identification of mlk Binding Ligands 

A number of techniques can be employed to isolate mlk ligand cDNA. Whether 
the ligand protein is secreted from the cell in a soluble form, or associated with the cell 
membrane by means of a transmembrane domain, GPI-linked anchor or heparin binding, will 
determine which route is most successful. 

Soluble ligands are detected in the conditioned media of cell lines. The 
conditioned media is concentrated by Amicon centriprep units 10-lOOx, and then used to 
stimulate proliferation of the BaF3 or other factor-dependent cell lines stably transfected with 
mlk receptor protein expression vectors. Proliferation can be measured by the incorporation of 
^H-thymidine. Positive signals in this assay are dissected using neutralizing antibodies to known 
growth factors, to ensure the response is due to the novel mlk ligand. Other confirmatory data 
may then be sought. For example, the conditioned media may be used to assay for 
phosphotyrosine induction on the mlk receptor (see assay above). 

In addition to cells transfected with a mlk receptor protein expression vector, other 
chimeric receptors may be used. For example, a strong proliferative signal is induced in cells 
such as BaF3 by hematopoietin receptor family members. The extracellular region of the EGF 
RTK has been fused to the transmembrane and inuracellular region of the erythropoietin receptor 
(EPO-R) (Pacific et al,. J. Biol, Chem., 1994, 269, 1571). In 32D cells, this fusion receptor 
gives a strong proliferation signal to EGF, that is better than that in cells transfected with vectors 
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encoding the complete EGF RTK. A mlk extracellular region/EPO-R transmembrane- 
intracellular region fusion ftinciions similarly in 32D and BaF3 cells, and is used for screening 
conditioned media for the mlk ligand. 

The ligand cDNA from a cell line is isolated by either expression cloning or 
protein purification. A cDNA library is made from poly A + RNA from the cell line. For 
expression cloning, this cDNA is ligated into an efficient COS cell expression vector, such as 
pXM or pED, and is transformed into £. coli DHIOB cells (Gibco-BRL) by electroporation. 
Pools of plasmid transformanis (approximately 100-2000) are plated individually on 100 mM L- 
amp plates. Copies are made on nitrocellulose filters. One copy is soaked in glycerol, and is 
used to cryopreserve the transformants at -80**C. An additional copy is used to make a plasmid 
DNA preparation from the pool. Such plasmids are transfected transiently into COS cells using 
DEAE-dextran, and after 48-72 hr. the COS conditioned media are screened on the bioassay. 
Once a positive pool is identified, the frozen copy is used to isolate the single positive plasmid 
by a process of subdivision and grid formation. 

The ligand can also be purified using the extracellular region of the receptor. The 
mlk extracellular region is fused in frame to the hinge-CH2-CH3 regions of human 
inununoglobulin (Ig) 7I, and is cloned into pEDAC. Using transient expression in COS cells, 
or selection and amplification in stably transfected CHO cells, the soluble mik-lg fusion is 
produced. It is purified from conditioned media using protein A-chromatography, The mlk Is 
fusion is then coupled to resins and used to affinity purify the mlk ligand. Other purifications 
steps such as reverse phase or ion exchange chromatography may also be used. 

The purified ligand protein is analyzed for amino acid sequence, and the resulting 
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data is used to design degenerate oligonucleotide primers. These primers are used to screen a 
cDNA library from the cell line source, either by hybridization in the presence of tetramethyl 
ammonium chloride, or by PCR. 

Membrane-associated mik ligands are isolated by a different procedure. The mlk- 
Ig fusion is used to screen for cell lines expressing the mlk ligand. The mlk-Ig fusion is either 
per-complexed with protein A-FITC, or bound to the cell surface and then detected in a second 
staining with goat F^^ ,, anti-human Ig-HTC (Southern Biotechnology Associates. Alabama). 
Binding is detected by flow cytometry. A cDNA library from the cell line source is prepared 
in the pXM or pEDAC vectors for expression in COS cells. The ligand cDNA can then be 
isolated by COS transfection, staining with mik-lg: protein A-FITC pre-complexed material, and 
soning the brightest population of transfccted COS cells, e.g.. with a Becton-Dickinson FACS 
Star. Plasmid DNA is rescued from this brightest population of COS cells by the Hirt procedure 
(Hirt, J. Mol. BioL, 1967, 26, 365), or by an alkaline-SDS mmi-plasmid preparation combined 
with a phenol/chloroform extraction. Plasmid DNA is transformed into E. coli DHIOB, a large 
scale plasmid isolation is then carried out and re-transfected into COS cells. By re-iteratively 
following this selection procedure, the mlk ligand cDNA is enriched (Yamasahi et aL. Science, 
1988, 241, 825; Rice et al., Cytometr)', 1991, 12, 221). After 3-5 rounds of selection, 
individual plasmids are analyzed by transfection and mlk-Ig binding. Alternatively, the 
transfected COS cells are enriched for the mlk ligand by "panning" (Seed et al.. Proc. Nail. 
Acad. Sci. USA, 1987, 84, 3365; Amffo et al., Proc. Natl. Acad. Sci. USA, 1987. 84. 8573). 
or by magnetic particle separation (Padmanabhan et al. Analytical Biochem., 1988. 170, 341). 

Expression libraries are also transfected into COS cells. The subpopulation of 
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cells expressing the membrane-bound ralk ligand is identified by staining with mlk-Ig and rabbit 
anti-human Ig conjugated to alkaline phsophatase and visualized with an insoluble alkaline 
phosphatase substrate, such as NBT/BCIP (Promega, WI). The positive cell population is 
scraped from the plate, plasmid is recovered and re-transfected in E. coli . and then transfected 
into COS cells for rescreening. In this way. single cDNAs expressing the mlk ligand can be 
isolated. This in situ staining technique can also be used to identify pools of clones containing 
the ligand cDNA. and then the specific clone isolated by reiterative pertition and screening of 
the pool. By permeabilization of the cell membrane, the mlk-Ig can be used to stain for 
intracellular mlk ligand in the secretory pathway. 

All patent and literamre references cited herein are incorporated by reference as 
if fiilly set forth. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Wood, Clive 

Caruso, Anthony 

(ii) TITLE OF INVENTION: Novel mlk Receptor Tyrosine Kinases 
(iii) NUMBER OF SEQUENCES: 21 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: LEGAL AFFAIRS 

(B) STREET: 87 Cambridge Park Drive 

(C) CITY: Cambridge 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) 2IP: 02140 

(V) COMPUTER READABLE FORM - 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

JC) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Brown, Scott A 

(B) REGISTRATION NUMBER: 32,724 

(C) REFERENCE/DOCKET NUMBER: GI5234A 

(ix) TELECOMMUNICATION INFORMATION- 

(A) TELEPHONE: (617) 498-8224 

(B) TELEFAX: (617) 876-5851 

(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2208 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

Hi) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 13.. 1602 
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(Xi) SEQUENCE DESCRIPTION: SEQ IDN0:1: 

GAATTCAAGC CG AAA GAT GCT CTT GTC TTC TTC AAC ACC TCC TAC CGG 4B 
Lys Asp Ala Leu Val Phe Phe Asn Thr Ser Tyr Arg 
I 5 10 

GAC CCC GAG GAC GCC CAG GAG CTG CTG ATC CAC ACT GCG TGG AAT GAG 96 
Asp Pro Glu Asp Ala Gin Glu Leu Leu He His Thr Ala Trp Asn Glu 
15 20 25 

CTG AAG GCT GTG AGT CCA CTG TGC CGG CCA GCT GCT GAG GCT CTG CTG 144 
Leu Lys Ala Val Ser Pro Leu Cys Arg Pro Ala Ala Glu Ala Leu Leu 
30 35 40 

TGT AAC CAC CTC TTC CAA GAG TGC AGC CCT GGA GTG GTA CCT ACT CCC 192 
Cys Asn His Leu Phe Gin Glu Cys Ser Pro Gly Val Val Pro Thr Pro 
45 50 55 60 

ATG CCC ATT TGC AGA GAG TAC TGC CTG GCG GTA AAG GAG CTC TTC TGT 24 0 

Met Pro He Cys Arg Glu Tyr Cys Leu Ala Val Lys Glu Leu Phe Cys 
65 70 75 

GCA AAG GAA TGG CAG GCA ATG GAA GGA AAG GCC CAC CGG GGC CTC TAC 2 88 

Ala Lys Glu Trp Gin Ala Met Glu Gly Lys Ala His Arg Gly Leu Tyr 
80 85 90 

AGA TCT GGG ATG CAT CTC CTT CCG GTA CCA GAG TGC AGC AAG CTT CCC 336 
Arg Ser Gly Met His Leu Leu Pro Val Pro Glu Cys Ser Lys Leu Pro 
95 100 105 

AGC ATG CAC CGG GAC CCC ACA GCC TGC ACA AGA CTG CCA TAT TTA GCA 384 
Ser Met His Arg Asp Pro Thr Ala Cys Thr Arg Leu Pro Tyr Leu Ala 
110 115 120 

TTC CCG TCA ATA ACG TCC TCC AGG CCG AGC GCG GAC ATT CCA AAC CTG 432 
Phe Pro Ser He Thr Ser Ser Arg Pro Ser Ala Asp He Pro Asn Leu 
"5 130 135 140 

CCT GCC TCC ACC TCT TCC TTT GCC GTC TCG CCT GCG TAC TCC ATG ACC 460 
Pro Ala Ser Thr Ser Ser Phe Ala Val Ser Pro Ala Tyr Ser Met Thr 
145 ISO 155 

GTC ATC ATC TCC ATC GTG TCC AGC TTT GCC CTG TTT GCT CTT CTC ACC 528 
Val He He Ser He Val Ser Ser Phe Ala Leu Phe Ala Leu Leu Thr 
160 165 170 

ATC GCT ACT CTC TAT TGC TGC CGA AGG AGG AAA GAA TGG AAA AAT AAG 576 
He Ala Thr Leu Tyr Cys Cys Arg Arg Arg Lys Glu Trp Lys Asn Lys 
175 180 185 

AAA AGA GAG TCG ACC GCG GTG ACC CTC ACC ACG TT6 CCT TCC GAG CTC 624 
Lys Arg Glu Ser Thr Ala Val Thr Leu Thr Thr Leu Pro Ser Glu Leu 
190 195 200 

CTG CTG GAT AGG CTC CAT CCC AAC CCC ATG TAC CAG AGG ATG CCA CTC 672 
Leu Leu Asp Arg Leu His Pro Asn Pro Met Tyr Gin Arg Met Pro Leu 
205 210 215 220 

CTT CTG AAT CCT AAG TTG CTC AGC CTG GAG TAT CCG AGG AAT AAC ATT 720 
Leu Leu Asn Pro Lys I^eu Leu Ser Leu Glu Tyr Pro Arg Asn Asn He 
225 230 235 
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GAG TAT GTC CGA GAC ATC GGA GAG GGG GCG TTT GGA AGA GTC TTC CAA 768 
Glu Tyr val Arg Asp lie Gly Glu Gly Ala Phe Gly Arg Val Phe Gin 
240 245 250 

GCA AGG GCC CCT GGC TTG CTG CCT TAT GAA CGT TTC ACT ATG GTG GCC 816 
Ala Arg Ala Pro Gly Leu Leu Pro Tyr Glu Pro Phe Thr Met Val Ala 
255 260 265 

GTG AAG ATG CTT AAG GAA GAG GCC TCT GCA GAC ATG CAA GCG GAC TTT 864 
Val Lys Met Leu Lys Glu Glu Ala Ser Ala Asp Met Gin Ala Asp Phe 
270 275 280 

CAG AGG GAG GCG GCC CTC ATG GCA GAG TTT GAC AAC CCC AAC ATC GTG 912 
Gin Arg Glu Ala Ala Leu Met Ala Glu Phe Asp Asn Pro Asn lie Val 
285 290 295 300 

AAA CTC TTA GGT GTG TGT GCC GTT GGG AAG CCG ATG TGT CTG CTC TTT 960 
Lys Leu Leu Gly Val Cys Ala Val Gly Lys Pro Met Cys Leu Leu Phe 
305 310 315 

GAA TAT ATG GCC TAT GGT GAC CTC AAT GAG TTC CTC CGA AGT ATG TCC 1008 
Glu Tyr Met Ala Tyr Gly Asp Leu Asn Glu Phe Leu Arg Ser Met Ser 
320 325 330 

CCG CAC ACT GTT TGC AGC CTC AGC CAC AGT GAC CTG TCC ACG AGG GCT 1056 
Pro His Thr Val Cys Ser Leu Ser His Ser Asp Leu Ser Thr Arg Ala 
335 340 345 

CGG GTG TCT AGC CCT GGT CCT CCA CCA CTG TCC TGT GCA GAA CAG CTC 1104 
Arg Val Ser Ser Pro Gly Pro Pro Pro Leu Ser Cys Ala Glu Gin Leu 
350 355 360 

TGC ATT GCC AGG CAG GTG GCA GCT GGC ATG GCC TAG CTT TCA GAG CGC 1152 
Cys He Ala Arg Gin Val Ala Ala Gly Met Ala Tyr Leu Ser Glu Arg 
365 370 375 380 

AAG TTT GTC CAC CGG GAC TTA GCT ACC AGG AAC TGC CTG GTT GGG GAG 1200 
Lys Phe Val His Arg Asp Leu Ala Thr Arg Asn Cys Leu Val Gly Glu 
385 390 395 

ACC ATG GTG GTG AAA ATT GCA GAC TTT GGC CTC TCC AGG AAC ATC TAT 1248 
Thr Met Val Val Lys He Ala Asp Phe Gly Leu Ser Arg Asn He Tyr 
400 405 410 

TCC GCA GAC TAC TAC AAA GCT GAT GGA AAT GAC GCC ATC CCT ATC CGC 1296 
Ser Ala Asp Tyr Tyr Lys Ala Asp Gly Asn Asp Ala lie Pro He Arg 
415 420 425 

TGG ATG CCG CCC GAG TCT ATC TTC TAC AAC CGC TAC ACC ACG GAG TCG 1344 
Trp Met Pro Pro Glu Ser He Phe Tyr Asn Arg Tyr Thr Thr Glu Ser 
430 435 440 

GAT GTA TGG GCC TAT GGT GTG GTC CTC TGG GAG ATC TTC TCC TAT GGG 1392 
Asp Val Trp Ala Tyr Gly Val Val Leu Trp Glu He Phe Ser Tyr Gly 
445 450 455 460 

CTG CAG CCC TAC TAT GGA ATG GCC CAC GAG GAG GTC ATT TAC TAT GTG 144 0 

Leu Gin Pro Tyr Tyr Gly Met Ala His Glu Glu Val He Tyr Tyr Val 
465 470 475 
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AGA GAT GGC AAC ATC CTC GCC TGC CCT GAG AAC TGC CCC TTG GAA CTG 1488 
Arg Asp Gly Asn He Leu Ala Cys Pro Glu Asn Cys Pro Leu Glu Leu 
4B0 485 490 

TAC AAC CTC ATG CGC CTG TGT TGG AGO AAG CTG CCT GCT GAT AGA CCC 15 36 

Tyr Asn Leu Met Arg Leu Cys Trp Ser Lys Leu Pro Ala Asp Arg Pro 
495 500 505 

AGC TTC TGC AGT ATC CAC AGO ATC CTG CAG CGC ATG TGC GAG AGA GCA 1584 
Ser Phe Cys Ser He His Arg He Leu Gin Arg Met Cys Glu Arg Ala 
510 515 520 

GAG GGA ACG GTG GGT GTC TAAAGTTGAC CATTCTCAAA CAACACCCAG 16 32 
Glu Gly Thr Val Gly Val 
525 530 

GAGGCTCTTT TCAGACTGTG AGCTGGAGGA ACCCTACCGC AGAGGCCGTG TAAGATCAGA 1692 

TAGGAGGAGT TTAACTCAGA CATCACGTGC CAGTTGATTG TTTGCCAGGA GAAACAGATG 1752 

GTGAATATCC CACAGGGTTA AAGAGTCACA TCGAAATAGG TTGGAGATAC AGGCTAGGAA 1812 

AGAGGACAGC AGGTAGCTCC TCTCCCTCAC AGGGACCGTT TCTAATATAT ATTGCATAAT 1872 

AAGAACATCC CGGTTACGTT CCTACATAAT CCCTCAGAGC GAGATTGCAG TGCTTAGGTT 1932 

GAATCCAAAA CTGGATGGGC AACTTCATTT TTAACAGAAG ACATCCTGCC CATTGCAAAA 1992 

GCAATGTGTC TTTGTGTATA TTTAGGTAAA GGACTQAAAA CTAAAQATAG GAATCCCTTC 2052 

TTCCACCAGT CAAGACACGT GGCAGGGTCT TGCTGTTGTT TAGTTCTTCC TTGCACAGAA 2112 

TATGTAACGT TGTATTTGCA TTCTGGAATT GAGTATCTAT TTTACTGATA GACTTTTGAA 2172 

GAATAAAAAG TGGAAAGCCT GCAAAAAAAA AA6CTT 2206 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 530 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ IDNO:2: 

Lys Asp Ala Leu Val Phe Phe Asn Thr Ser Tyr Arg Asp Pro Glu Asp 
15 10 15 

Ala Gin Glu Leu Leu He His Thr Ala Trp Asn Glu Leu Lys Ala Val 
20 25 30 

Ser Pro Leu Cys Arg Pro Ala Ala Glu Ala X/eu Leu Cys Asn His Leu 
35 40 45 

Phe Gin Glu Cys Ser Pro Gly Val Val Pro Thr Pro Met Pro He Cys 
50 55 60 

Arg Glu Tyr Cys Leu Ala Val Lys Glu Leu Phe Cys Ala Lys Glu Trp 
65 70 75 80 
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Gin Ala Met Glu Gly hys Ala His Arg Gly Leu Tyr Arg Ser Gly Met 
85 90 95 

His Leu Leu Pro Val Pro Glu Cys Ser Lys Leu Pro Ser Met His Arg 
100 105 110 

Asp Pro Thr Ala Cys Thr Arg Leu Pro Tyr Leu Ala Phe Pro Ser lie 
115 120 125 

Thr Ser Ser Arg Pro Ser Ala Asp lie Pro Asn Leu Pro Ala Ser Thr 
130 135 140 

Ser Ser Phe Ala Val Ser Pro Ala Tyr Ser Met Thr Val He He Ser 
145 150 155 160 

He Val Ser Ser Phe Ala Leu Phe Ala Leu Leu Thr He Ala Thr Leu 
165 170 175 

Tyr Cys Cys Arg Arg Arg Lys Glu Trp Lys Asn Lys Lys Arg Glu Ser 
180 IBS 190 

Thr Ala Val Thr Leu Thr Thr Leu Pro Ser Glu Leu Leu Leu Asp Arg 
195 200 205 

Leu His Pro Asn Pro Met Tyr Gin Arg Met Pro Leu Leu Leu Asn Pro 
210 215 220 

Lys Leu Leu Ser Leu Glu Tyr Pro Arg Asn Asn He Glu Tyr Val Arg 
225 230 235 240 

Asp He Gly Glu Gly Ala Phe Gly Arg Val Phe Gin Ala Arg Ala Pro 



Gly Leu Leu Pro Tyr Glu Pro Phe Thr Met Val Ala Val Lys Met Leu 
260 265 270 

Lys Glu Glu Ala Ser Ala Asp Met Gin Ala Asp Phe Gin Arg Glu Ala 
275 280 285 

Ala Leu Met Ala Glu Phe Asp Asn Pro Asn He Val Lys Leu Leu Gly 
290 295 300 

Val Cys Ala Val Gly Lys Pro Met Cys Leu Leu Phe Glu Tyr Met Ala 
305 310 315 320 

Tyr Gly Asp Leu Asn Glu Phe Leu Arg Ser Met Ser Pro His Thr Val 
325 330 335 

Cys Ser Leu Ser His Ser Asp Leu Ser Thr Arg Ala Arg Val Ser Ser 
340 345 350 

Pro Gly Pro Pro Pro Leu Ser Cys Ala Glu Gin Leu Cys He Ala Arg 
355 360 365 

Gin Val Ala Ala Gly Met Ala Tyr Leu Ser Glu Arg Lys Phe Val His 
370 375 380 

Arg Asp Leu Ala Thr Arg Asn Cys Leu Val Gly Glu Thr Met Val Val 
385 390 395 400 

Lys He Ala Asp Phe Gly Leu Ser Arg Asn He Tyr Ser Ala Asp Tyr 



245 



250 



255 



405 



410 



415 
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Tyr Lys Ala Asp Gly Asn Asp Ala lie Pro lie Arg Trp Met Pro Pro 
420 425 430 

Glu Ser He Phe Tyr Asn Arg Tyr Thr Thr Glu Ser Asp Val Trp Ala 
435 440 445 

Tyr Gly Val Val Leu Trp Glu He Phe Ser Tyr Gly Leu Gin Pro Tvr 
450 455 

Tyr Gly Met Ala His Glu Glu Val He Tyr Tyr Val Arg Asp Gly Asn 

470 475 480 

He Leu Ala Cys Pro Glu Asn Cys Pro Leu Glu Leu Tyr Asn Leu Met 
485 490 495 

Arg Leu Cys Trp Ser Lys Leu Pro Ala Asp Arg Pro Ser Phe Cys Ser 
500 505 510 

He His Arg He Leu Gin Arg Met Cys Glu Arg Ala Glu Gly Thr Val 
515 520 525 

Gly Val 
530 

(2) INFORMATION FOR SEQ ID NO: 3: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIiOGY: linear 

Ui) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GTTAGGATCC ACMGNGAYYT 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
AGGAATTCCR V.'AGGACCASA CRTC 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
{iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO : 5 
GATGAATTCG AYYTNGCNGC NMGNAA 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(Xi) SEQUENCE DESCRIPTION: SBQ ID NO: 6 
GTTGAATTCT CNARNGCNAR CCA 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 
TACTACAAAG CTGATGGAAA TGACG 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CAGACTGTGA GCTGGAGGAC C 
(2 J INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(iil MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9; 
GGATTATGTA GGAACGTAAC C 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 



(xi) SEQtJENCE DESCRIPTION: SEQ ID NO; 10 
CGAAATAGGT TGGAGATACA GG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CGTGTCTTGA CTGGTGGAAG AAGG 2 4 

(2) INFORMATION FOR SEQ ID NO:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3398 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 

(ix) FEATURE: 

<A) NAME/KEY: CDS 

(B) LOCATION: 121.. 2961 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

CTTTCGGGAC TGTCAGTGAA TCCAGAGAAG CTAACATCTA TGAACTCTGA TTTGAATACA 60 

GGATTCAAGC GTGTACTGGC CTGTTTGGCA GAAAAATATC ATTTCTGATC GACGATCATC 120 

ATG AAC TTT ATC CCA GTC GAC ATT CCA CTC TTG ATG ATC TTC CTT GTG 168 
Met Asn Phe lie Pro Val Asp lie Pro Leu Leu Met lie Phe Leu Val 
15 10 -15 

ACA ACT GGG GGC TCA OCT GAC GGA ATC CTT CCC AAA GCT CCA CAG ATC 216 
Thr Thr Gly Gly Ser Ala Asp Gly lie Leu Pro Lys Ala Pro Gin lie 
20 25 30 

ACC AGT CCC TTG GAG ACA GTG GAT GCC TTG GTT GAG GAA GAA GCT TCT 264 
Thr Ser Pro Leu Glu Thr Val Asp Ala Leu Val Glu Glu Glu Ala Ser 
35 40 45 

TTC ATG TGT GCA GTG GAT TCA TAC CCA GCG GCA GAG ATT ACC TGG ACC 312 
Phe Met Cys Ala Val Asp Ser Tyr Pro Ala Ala Glu lie Thr Trp Thr 
50 55 60 

CGC AAT AAC ATT CCC ATA AGA CCC TTT GAC ACT CGC TAC AGT ACA AAA 360 
Arg Asn Asn lie Pro He Arg Pro Phe Asp Thr Arg Tyr Ser Thr Lys 
65 70 75 80 

GAA AAT GGC CAG ATA TTA ACC ATC CTC AGC GTT GAA GAC ACA GAC AAT 4 08 

Glu Asn Gly Gin He Leu Thr He Leu Ser Val Glu Asp Thr Asp Asn 
85 90 95 

GGG GTG TAC TGC TGC ACC GCC AAC AAC GGC ATG GGG AGC TCT GCT CAA 4 56 

Gly Val Tyr Cys Cys Thr Ala Asn Asn Gly Met Gly Ser Ser Ala Gin 
100 105 110 

AGC TGT GGT GCC CTC CAG GTC AAA ATG AAG CCA AAG ATC ATT CGG CCA 504 
Ser Cys Gly Ala Leu Gin Val Lys Met Lys Pro Lys He He Arg Pro 
115 120 125 
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CCC ACT GAT GTC AGA GCA CTG CTG GGA TCG AAG GTT GTG TTA CCT TGC 552 
Pro Thr Asp Val Arg Ala Leu Leu Gly Ser Lys Val val Leu Pro Cys 
130 135 140 

AGT ACC ATG GGG AAT CCA AAA CCA GCC ATT TCA TGG TTC AAA GAT GAA 6 00 

Ser Thr Met Gly Asn Pro Lys Pro Ala lie Ser Trp Phe Lys Asp Glu 
145 150 155 160 

ACT GCA CTG AAA AAT GAC CAG CCT CGA ACT TCT GTC CTT GAG TCT GGG 64 8 

Thr Ala Leu Lys Asn Asp Gin Pro Arg Thr Ser Val Leu Glu Ser Gly 
165 170 175 

AAC TTA AGA ATT CGC AAT GTT CAG CTT GAA GAT OCA GGA AAA TAT CGA 6 96 

Asn Leu Arg lie Arg Asn Val Gin Leu Glu Asp Ala Gly Lys Tyr Arg 
180 185 190 

TGT TTG GCA AGA AAC AGC CTG GGC TTC GAG TAT TCC AGA TCT GCG GCT 744 
Cys Leu Ala Arg Asn Ser Leu Gly Phe Glu Tyr Ser Arg Ser Ala Ala 
195 200 205 

CTG GAA GTG CAG GTC TCT GCC AGA ATT GTG AAG GCG CCC ACA TCA CAA 792 
Leu Glu Val Gin Val Ser Ala Arg He Val Lys Ala Pro Thr Ser Oln 
210 215 220 

AAT GTC AGC TAT GGT TCC GAA GTG ATC TTG CAG TGC AAA GCC ACC GGG 84 0 

Asn Val Ser Tyr Gly Ser Glu Val He Leu Gin Cys Lys Ala Thr Gly 
225 230 235 240 

TTT CCG ATT CCC ACC ATC AAG TGG TTG GAG AAT GGG AGA GCA GTC CCC 886 
Phe Pro He Pro Thr He Lys Trp Leu Glu Asn Gly Arg Ala Val Pro 
245 250 255 

AAG GGT TCG ATA CAG AAT CGC ATC AAG GGA GAG GTG ATG GAA TCT AGG 936 
Lys Gly Ser He Gin Asn Arg He Lys Gly Glu Val Met Glu Ser Arg 
260 265 270 

CTG CGG GTC TAT GTT ACC AGA CCT TCA CTG TTC ACT TGC CTG ACT ACC 964 
Leu Arg Val Tyr Val Thr Arg Pro Ser Leu Phe Thr Cys Leu Thr Thr 
275 280 285 

AAC fiHG CAC AAT GAA GGA AGT ACC ACA GCA AAA GCC ACT GCC ACC CTG 1032 
Asn Lys His Asn Glu Gly Ser Thr Thr Ala Lys Ala Thr Ala Thr Leu 
290 295 300 

GAT ATC AAA GAA TGG AGA TTG TAC AAA GGT GAC TTG GGC TAT TGC AGC 1080 
Asp He Lys Glu Trp Arg Leu Tyr Lys Gly Asp Leu Gly Tyr Cys Ser 
305 310 315 320 

ACA TAT CGT GGT GAG GTA TGC CAA GGT CTT CTG GGA AAT GGC CAG CTG 11^8 
Thr Tyr Arg Gly Glu Val Cys Gin Gly Leu Leu Gly Asn Gly Gin Leu 
325 330 335 

GTT TTC TTC AAC TCT TCT TTT GCC GAT GCA GAG GGG ACA CAA GAG ATG 1176 
Val Phe Phe Asn Ser Ser Phe Ala Asp Ala Glu Gly Thr Gin Glu Met 
340 345 350 

ATG GCC AGG AGC ACA TGG ACG GAG TTG GAT GGC GTC AGC TTG CTG TGC 1224 
Met Ala Arg Ser Thr Trp Thr Glu Leu Asp Gly Val Ser Leu Leu Cys 
355 360 365 
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AAA CCA GOT GCC GAG TCC CTA CTC TGC CAC TTC ATT TTC CAA GAC TGT 12 7 2 

Lys Pro Ala Ala Glu Ser Leu Leu Cys His Phe He Phe Gin Asp Cys 
370 375 380 

AAT CCr TTA GGG CTG GGT CCT ACT CCC AAA CTT GTG TGC CGT GAG CAT 132 0 

Asn Pro Leu Gly Leu Gly Pro Thr Pro Lys Leu Val Cys Arg Glu His 
385 390 395 400 

TGC TTG GCA GTG AAA GAG CTT TAT TGT TAC AAA GAA TGG ATC ACA ATG 1366 
Cys Leu Ala Val Lys Glu Leu Tyr Cys Tyr Lys Glu Trp lie Thr Mec 
405 410 415 

GAG GAC AAT TCA CGC ATA GGA GTT TAC TCT GCG GGT CTG AGC CTA CCA 1416 
Glu Asp Asn Ser Arg lie Gly Val Tyr Ser Ala Gly Leu Ser Leu Pro 
420 425 430 

GAC TGT CAG AGG CTT CCC AGT ATA CAC CAT GAC CCA GAA GCA TGC ACC 1464 
Asp Cys Gin Arg Leu Pro Ser lie His His Asp Pro Glu Ala Cys Thr 
435 440 445 

AGA GTC TCT TTT CTT GAC ATG AAG AAG GGG CTC GTT ACC AG A ATG TGT 1512 
Arg Val Ser Phe Leu Asp Wet Lys Lys Gly Leu Val Thr Arg Met Cys 
450 455 460 

TAC AAC AAT AAC GGG AGG TTT TAC CAG GGA TCG GTG AAT GTC ACT GCA 1560 
Tyr Asn Asn Asn Gly Arg Phe Tyr Gin Gly Ser Val Asn Val Thr Ala 
465 470 475 480 

TCA GGC ATT TCC TGT CAG AGA TGG AGT GAG CAG GCT CCT CAT TTC CAC 1608 
Ser Gly He Ser Cys Gin Arg Trp Ser Glu Gin Ala Pro His Phe His 
485 490 495 

AGG CGT CTG CCA GAG ATA TTT CCT GAA TTA GCC AAT TCT GAC AAC TTC 1656 
Arg Arg Leu Pro Glu He Phe Pro Glu Leu Ala Asn Ser Asp Asn Phe 
500 505 510 

TGC CGG AAC CCA GGA GGT GAG AGT GAA CGA CCG TGG TGT TAT ACG ATG 1704 
Cys Arg Asn Pro Gly Gly Glu Ser Glu Arg Pro Trp Cys Tyr Thr Met 
515 520 525 

GAT CGA GAC ATC CGG TGG GAA TTC TGC AAT GTG CCT CAA TGT ATC AAT 1752 
Asp Arg Asp He Arg Trp Glu Phe Cys Asn Val Pro Gin Cys He Asn 
530 535 540 

GTT TCC TCA ATA TCA GAG ATG AAG CCT AAA ACA GAA ACA GCC AAC ACT 1800 
Val Ser Ser He Ser Glu Met Lys Pro Lys Thr Glu Thr Ala Asn Thr 
545 550 555 560 

CCC AGC ACT TCT GCC ACC TAC TCA ATG ACC GTC ATA ATT TCC ATA ATC 1848 
Pro Ser Thr Ser Ala Thr Tyr Ser Met Thr Val He lie Ser He He 
565 570 575 

TCC AGC CTT GCA GCC TCC ATC CTG TTG ATA ATT ATA ATT CTC ACT TGT 18 96 

Ser Ser Leu Ala Ala Ser He Leu Leu He He He He Leu Thr Cys 
560 585 590 

CAC CAT CAC CAG TlAG GGA TTG CAG ACC AGA AAG AGT TAC AGA ACA ACT 1944 
His His His Gin Lys Gly Leu Gin Thr Arg Lys Ser Tyr Arg Thr Thr 
595 600 605 
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GAG ACC CCT ACT CTG GCT ACT CTT OCT TCA GAG CTG CTT CTA GAG AG A 
Glu Thr Pro Thr Leu Ala Thr Leu Pro Ser Glu Leu Leu Leu Asp Arg 
610 615 620 

CTT CAC CCC AAC CCA ATG TAC CAG CGC CTG CCT CTT CTT CTA AAT GCT 2040 
Leu His Pro Asn Pro Met Tyr Gin Arg Leu Pro Leu Leu Leu Asn Ala 
625 630 635 640 

AAA CTA CTG AGC CTC GAG TAT CCA AGG AAT AAC ATA GAA TAT GTG CGG 20 68 

Lys Leu Leu Ser Leu Glu Tyr Pro Arg Asn Asn lie Glu Tyr Val Arg 
645 650 655 

GAT ATT GGA GAG GGA GCA TTT GGA AGA GTA TTC CAG 6CA AG A GCC CCT 2136 
Asp lie Gly Glu Gly Ala Phe Gly Arg Val Phe Gin Ala Arg Ala Pro 
660 665 670 

CAT CTG CTG CCG CAG GAG ACC TCC ACC ATG GTG GCT GTG AAG ATG CTT 2184 
His Leu Leu Pro Gin Glu Thr Ser Thr Met Val Ala Val Lys Met Leu 
675 680 685 

AAA GAA GAA GCG TCA CCT GAC ATG CAG GCA GAC TTC CGG AGA GAA GCA 2232 
Lys Glu Glu Ala Ser Pro Asp Met Gin Ala Asp Phe Arg Arg Glu Ala 
690 695 700 

GCG CTC ATG GCA GAG TTC AAC CAT CCA AAC ATC GTC AAG CTT TTA GGA 2280 
Ala Leu Met Ala Glu Phe Asn His Pro Asn He Val Lys Leu Leu Glv 
■^05 710 715 .720 

GTG TGC GCT GTT GGA AAG CCG ATG TGC CTG CTA TTC GAG TAC ATG GCG 2328 
Val Cys Ala Val Gly Lys Pro Met Cys Leu Leu Phe Glu Tyr Met Ala 
725 730 735 

CAT GGA GAC CTG AAC GAG TAT TTA CGC AAG CGG TCA CCC ATC ACC GCC 2376 
His Gly Asp Leu Asn Glu Tyr Leu Arg Lys Arg Ser Pro He Thr Ala 
740 745 750 

CGC ACC TTG AGG CCC GCC AAT TGT GTG GGA TGG AGC AGC OGC TGG GGA 2424 
Arg Thr Leu Arg Pro Ala Asn Cys Val Gly Trp Ser Ser Gly Trp Gly 
755 760 765 

AAG GGC CTG ACA GCC CTC AGC TGC GCT GAC CAA CTG AAC ATC GCC AAG 2472 
Lys Gly Leu Thr Ala Leu Ser Cys Ala Asp Gin Leu Asn He Ala Lys 
770 775 780 

CAG ATC TCA GCG GGC ATG ACC TAC CTG TCG GAG CGC AAG TTT GTT CAC 2520 
Gin He Ser Ala Gly Met Thr Tyr Leu Ser Glu Arg Lys Phe Val His 
■^^5 790 795 800 

CGG GAC CTG GCC ACC CGT AAC TGC TTG GTT GGA GAG AAG CTG GTA GTT 25*8 
Arg Asp Leu Ala Thr Arg Asn Cys Leu Val Gly Glu Lys X^u Val Val 
805 810 815 

AAG ATT GCT GAC TTT GGC CTC TCC AGG AAC ATC TAC TCT GCG GAC TAT 2616 
Lys He Ala Asp Phe Gly Leu Ser Arg Asn He Tyr Ser Ala Asp Tyr 
820 825 830 

TAC AAG GCC AAT GAG AAT GAT GCC ATC CCG ATC AGG TGG ATG CCT CCT 2664 
Tyr Lys Ala Asn Glu Asn Aep Ala He Pro He Arg Trp Met Pro Pro 
835 840 845 
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GAA TCC ATA TTC TTC AAC CGT TAT ACC ACC GAG TCC GAC GTC TGG GCT 2 712 

Glu Ser lie Phe Phe Asn Arg Tyr Thr Thr Glu Ser Asp Val Trp Ala 
850 855 860 

TAT GGT GTG GTC CTG TGG GAG ATC TTC TCG TCC GGC ATG CAG CCA TAC 2 76 0 

Tyr Gly Val Val Leu Trp Glu lie Phe Ser Ser Gly Met Gin Pro Tyr 
865 870 875 880 

TAT GGG ATG GCC CAC GAA GAG GTG ATC TAC TAT GTT CGA GAC GGG AAC 2808 
Tyr Gly Met Ala His Glu Glu Val lie Tyr Tyr Val Arg Asp Gly Asn 
885 890 895 

ATC CTG TCC TGC CCC GAG AAC TGC CCA CCA GAG CTG TAC AAC TTG ATG 2 856 

lie Leu Ser Cys Pro Glu Asn Cys Pro Pro Glu Leu Tyr Asn Leu Met 
900 905 910 

CGC CTC TGC TGG AGT AAC ATG CCA TCA GAC AGA CCG ACG TTC GCC AGT 2 904 

Arg Leu Cys Trp Ser Asn Met Pro Ser Asp Arg Pro Thr Phe Ala Ser 
915 920 925 

ATC CAT CGC ATC CTG GAG CGC ATG CAC CAG AGG ATG GCA GCC GCA CTC 2 952 

He His Arg He Leu Glu Arg Met His Gin Arg Met Ala Ala Ala Leu 
930 935 940 

CCA GTC TGATCCCCCC TCCCCCCTCT GTACCTTGGG GTACATGTTC CCTGTGCAAG 3 008 

Pro Val 

945 



ATTGGCAAGG 


GTTATGGCCG 


GTTGCCTGAC 


CGTGCCTGAC 


TTCGGATGCC 


AGGAGCACAC 


3068 


ACCACGCCAA 


CCAGGCCCTT 


AAATACTCTG 


ACCCCCACAA 


TCTTCATTTC 


ACAAACTAGC 


3128 


CATGCCCAGT 


ACATTGAAAC 


ACCCAATATT 


AAACCGTAGG 


TTCCAATTAC 


CATCCCTCCC 


3188 


ATTTTTTGTT 


CATGAACACG 


TTTCTCAATT 


ATTACTQGAT 


GTTCATTTAA 


TTATATATGA 


3248 


TGACATTTTA 


TQAGGGGCTT 


CAGTAAAGAA 


GACAGGGAGA 


CTGTCACCAT 


6GTAGAAGGC 


3308 


TCGTAACCAA 


AG6CCCCTAA 


TTTAAAAAGA 


AAAGTCAGGG 




TTTGTGGATT 


3368 


GTTCCAATCA 


ACTTGAAAAA 


GAAGGAATTC 








3398 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 946 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:13: 

Met Asn Phe He Pro Val Asp He Pro Leu Leu Met He Phe Leu Val 
15 10 15 

Thr Thr Gly Gly Ser Ala Asp Gly He Leu Pro Lys Ala Pro Gin He 
20 25 30 

Thr Ser Pro Leu Glu Thr Val Asp Ala Leu Val Glu Glu Glu Ala Ser 
35 40 45 
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Phe Met Cys Ala Val Asp Ser Tyr Pro Ala Ala Glu lie Thr Trp Thr 
50 55 60 

Arg Asn Asn lie Pro lie Arg Pro Phe Asp Thr Arg Tyr Ser Thr Lys 
65 70 75 80 

Glu Asn Gly Gin He Leu Thr He Leu Ser Val Glu Asp Thr Asp Asn 
85 90 95 

Gly Val Tyr Cys Cys Thr Ala Asn Asn Gly Met Gly Ser Ser Ala Gin 
100 105 110 

Ser Cys Gly Ala Leu Gin Val Lys Met Lys Pro Lys lie He Arg Pro 
115 120 125 

Pro Thr Asp Val Arg Ala Leu Leu Gly Ser Lys Val Val Leu Pro Cys 
130 135 140 

Ser Thr Met Gly Asn Pro Lys Pro Ala He Ser Trp Phe Lys Asp Glu 
145 150 155 160 

Thr Ala Leu Lys Asn Asp Gin Pro Arg Thr Ser Val Leu Glu Ser Gly 
165 170 175 

Asn Leu Arg He Arg Asn Val Gin Leu Glu Asp Ala Gly Lys Tyr Arg 
180 185 190 

Cys Leu Ala Arg Asn Ser Leu Gly Phe Glu Tyr Ser Arg Ser Ala Ala 
195 200 205 

Leu Glu Val Gin Val Ser Ala Arg He Val Lys Ala Pro Thr Ser Glh 
210 215 220 

Asn Val Ser Tyr Gly Ser Glu Val He Leu Gin Cys Lys Ala Thr Gly 
225 230 235 240 

Phe Pro He Pro Thr He Lys Trp Leu Glu Asn Gly Arg Ala Val Pro 
245 250 255 

Lys Gly Ser He Gin Asn Arg He Lys Gly Glu Val Met Glu Ser Arg 
260 265 270 

Leu Arg Val Tyr Val Thr Arg Pro Ser Leu Phe Thr Cys Leu Thr Thr 
275 280 265 

Asn Lys His Asn Glu Gly Ser Thr Thr Ala Lys Ala Thr Ala Thr Leu 
290 295 300 

Asp He Lys Glu Trp Arg Leu Tyr Lys Gly Asp Leu Gly Tyr Cys Ser 
305 310 315 320 

Thr Tyr Arg Gly Glu Val Cys Gin Gly Leu Leu Gly Asn Gly Gin Leu 
325 330 335 

Val Phe Phe Asn Ser Ser Phe Ala Asp Ala Glu Gly Thr Gin Glu Met 
340 345 350 

Met Ala Arg Ser Thr Trp Thr Glu Leu Asp Gly Val Ser Leu Leu Cys 
355 360 365 

Lys Pro Ala Ala Glu Ser Leu Leu Cys His Phe He Phe Gin Asp Cys 
370 375 380 
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Asn Pro Leu Gly Leu Gly Pro Thr Pro Lys Leu Val Cys Arg Glu His 
3B5 390 395 400 

Cys Leu Ala Val Lys Glu Leu Tyr Cys Tyr Lys Glu Trp lie Thr Met 
405 410 ' 415 

Glu Asp Asn Ser Arg lie Gly Val Tyr Ser Ala Gly Leu Ser Leu Pro 
420 425 430 

Asp Cys Gin Arg Leu Pro Ser He His His Asp Pro Glu Ala Cys Thr 
435 440 445 

Arg Val Ser Phe Leu Asp Met Lys Lys Gly Leu Val Thr Arg Met Cys 
450 455 460 

Tyr Asn Asn Asn Gly Arg Phe Tyr Gin Gly Ser Val Asn Val Thr Ala 
465 470 475 480 

Ser Gly lie Ser Cys Gin Arg Trp Ser Glu Gin Ala Pro His Phe His 
485 490 495 

Arg Arg Leu Pro Glu He Phe Pro Glu Leu Ala Asn Ser Asp Asn Phe 
500 505 510 

Cys Arg Asn Pro Gly Gly Glu Ser Glu Arg Pro Trp Cys Tyr Thr Met 
515 520 525 

Asp Arg Asp He Arg Trp Glu Phe Cys Asn Val Pro Gin Cys He Asn 
530 535 540 

Val Ser Ser He Ser Glu Met Lys Pro Lys Thr Glu Thr Ala Asn Thr 
545 550 555 560 

Pro Ser Thr Ser Ala Thr Tyr Ser Met Thr Val He He Ser He He 
565 570 575 

Ser Ser Leu Ala Ala Ser He Leu Leu He He He He Leu Thr Cys 
580 585 590 

His His His Gin Lys Gly Leu Gin Thr Arg Lys Ser Tyr Arg Thr Thr 
595 600 605 

Glu Thr Pro Thr Leu Ala Thr Leu Pro Ser Glu Leu Leu Leu Asp Arg 
610 615 620 

Leu His Pro Asn Pro Met Tyr Gin Arg Leu Pro Leu Leu Leu Asn Ala 
€25 630 635 640 

Lys Leu Leu Ser Leu Glu Tyr Pro Arg Asn Asn He Glu Tyr Val Arg 
645 650 655 

Asp He Gly Glu Gly Ala Phe Gly Arg Val Phe Gin Ala Arg Ala Pro 
660 665 670 

His Leu Leu Pro Gin Glu Thr Ser Thr Met Val Ala Val Lys Met Leu 
675 680 685 

Lys Glu Glu Ala Ser Pro Asp Met Gin Ala Asp Phe Arg Arg Glu Ala 
690 695 700 

Ala Leu Met Ala Glu Phe Asn His Pro Asn He Val Lys Leu Leu Gly 
705 710 715 720 



54 



W 96/02644 



PCTAJS95/0M93 



Val Cys Ala Val Gly Lys Pro Met Cys Leu Leu Phe Glu Tyr Met Ala 
725 730 735 

His Gly Asp Leu Asn Glu Tyr Leu Arg Lys Arg Ser Pro lie Thr Ala 
740 74 5 75 G 

Arg Thr Leu Arg Pro Ala Asn Cys Val Gly Trp Ser Ser Gly Trp Gly 
755 760 765 

Lys Gly Leu Thr Ala Leu Ser Cys Ala Asp Gin Leu Asn lie Ala Lys 
770 775 780 

Gin lie Ser Ala Gly Met Thr Tyr Leu Ser Glu Arg Lys Phe Val His 
785 790 795 800 

Arg Asp Leu Ala Thr Arg Asn Cys Leu Val Gly Glu Lys Leu Val Val 
805 8X0 815 

Lys lie Ala Asp Phe Gly Leu Ser Arg Asn lie Tyr Ser Ala Asp Tyr 
820 825 830 

Tyr Lys Ala Asn Glu Asn Asp Ala lie Pro lie Arg Trp Met Pro Pro 
835 840 845 

Glu Ser lie Phe Phe Asn Arg Tyr Thr Thr Glu Ser Asp Val Trp Ala 
850 855 860 

Tyr Gly Val Val lieu Trp Glu lie Phe Ser Ser Gly Met Gin Pro Tyr 
865 870 875 880 

Tyr Gly Met Ala His Glu Glu Val lie Tyr Tyr Val Arg Asp Gly Asn 
865 890 895 

He Leu Ser Cye Pro Glu Asn Cys Pro Pro Glu Leu Tyr Asn I*eu Met 
900 905 910 

Arg Leu Cys Trp Ser Asn Met Pro Ser Asp Arg Pro Thr Phe Ala Ser 
915 920 925 

He His Arg lie Leu Glu Arg Met His Gin Arg Met Ala Ala Ala Leu 
930 935 940 

Pro Val 
945 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1581 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 146.. 1579 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14; 

CAGACACAAA CAGTCCTTAG CGGACAACTC TATTGTAACA AACCATGCTT TAAAATGTAA €0 

ACCCGGGAGC GTGTTTTTTG tTTTTTTTTT TTTTTTTTCC TCACGTTGTC CAGAAGCAAC 12 0 

CTTTCTTCCT GAGCCTGGAT TAATC ATG AGA GAG CTT GTC AAC ATT CCA CTG 172 

Met Arg Glu Leu Val Asn lie Pro Leu 
1 5 

TTA CAG ATG CTC ACC CTG GTT GCC TTC AGC GGG ACT GAG AAA CTT CCA 22 0 

Leu Gin Met Leu Thr Leu Val Ala Phe Ser Gly Thr Glu Lys Leu Pro 
10 15 20 25 

AAA GCC CCT GTC ATC ACC ACG CCT CTT GAA ACT GTA GAT GCC TTG GTT 268 
Lys Ala Pro Val He Thr Thr Pro Leu Glu Thr Val Asp Ala Leu Val 
30 35 40 

GAA GAA GTA GCG ACT TTC ATG TGT GCC GTG GAA TCC TAC CCT CAG CCC 316 
Glu Glu Val Ala Thr Phe Met Cys Ala Val Glu Ser Tyr Pro Gin Pro 
45 50 55 

GAG ATT TCT TGG ACC AGA AAT AAA ATT CTC ATT AAG CTG TTT GAC ACC 364 
Glu He Ser Trp Thr Arg Asn Lys He Leu He Lys Leu Phe Asp Thr 
60 65 70 

CGC TAC AGC ATC CGG GAG AAT GGT CAG CTC CTC ACC ATT CTG AGC GTG 412 
Arg Tyr Ser He Arg Glu Asn Gly Gin Leu Leu Thr He Leu Ser Val 
75 80 65 

GAA GAC AGT GAT GAT GGC ATC TAC TGC TGC ATA GCC AAC AAT GGA GTG 460 
Glu Asp Ser Asp Asp Gly He Tyr Cys Cys He Ala Asn Asn Gly Val 
90 95 100 105 

GGA GGA GCC GTG GAG AGT TGT GGT GCC CTG CAA GTG AAG ATG AAA CCT 508 
Gly Gly Ala Val Glu Ser Cys Gly Ala Leu Gin Val Lys Met Lys Pro 
110 115 120 

AAA ATA ACT CGT CCT CCC ATT AAT GTA AAA ATA ATA GAG GGA TTG AAG 556 
Lys He Thr Arg Pro Pro He Asn Val Lys He He Glu Gly I^u Lys 
125 130 135 

GCA GTT CTG CCG TGC ACT ACG ATG GGT AAC CCC AAA CCA TCT GTG TCC 604 
Ala Val Leu Pro Cys Thr Thr Met Gly Asn Pro Lys Pro Ser Val Ser 
140 145 150 

TGG ATC AAG GGG GAC AAT GCT CTC AGG GAA AAT TCC AGA ATC GCA GTT 652 
Trp He Lys Gly Asp Asn Ala Leu Arg Glu Asn Ser Arg He Ala Val 
155 160 165 

CTT GkA TCT GGG AGC TTA AGG ATC CAT AAT GTG CAA AAG GAA GAT GCA 700 
Leu Glu Ser Gly Ser Leu Arg He His Asn Val Gin Lys Glu Asp Ala 
170 175 100 185 

GGA CAG TAC CGC TGT GTG GCC AAA AAC AGC CTG GGC ACA GCT TAC TCC 74 8 

Glv Gin Tyr Arg Cys Val Ala Lys Asn Ser Leu Gly Thr Ala Tyr Ser 
190 ' 195 200 

AAA CTG GTG AAG CTG GAA GTG GAG GTT TTT GCA AGA ATC CTG CGT GCT 7 96 

Lys Leu Val Lvs Leu Glu Val Glu Val Phe Ala Arg He Leu Arg Ala 
205 210 215 
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CCT GAA TCC CAC AAT GTC ACC TTT GGT TCC TTT GTA ACC CTA CGC TGC 844 
Pro Glu Ser His Asn Val Thr Phe Gly Ser Phe Val Thr Leu Arg Cys 
220 225 230 

ACA GCA ATA GGC ATC CCT GTC CCC ACC ATC AGC TGG ATT GAA AAC GGA 892 
Thr Ala He Gly He Pro Val Pro Thr He Ser Trp He Glu Asn Gly 
235 240 245 

AAT GCT GTT TCT TCA GGT TCC ATT CAA GAG AGT GTG AAA 6AC CGA GTG 940 
Asn Ala Val Ser Ser Gly Ser He Gin Glu Ser Val Lys Asp Arg val 
250 255 260 265 

ATT 6AC TCA AG A CTC CAG CTC TTC ATC ACA AAG CCA GGA CTC TAC ACA 988 
He Asp Ser Arg Leu Gin Leu Phe He Thr Lys Pro Gly Leu Tyr Thr 
270 275 280 

TGC ATA GCT ACC AAT AAG CAC GGA GAA AAG TTC AGT ACC GCA AAG GCT 1036 
Cys He Ala Thr Asn Lys His Gly Glu Lys Phe Ser Thr Ala Lys Ala 
285 290 295 

GCA GCC ACT GTC AGC ATA GCA GAA TGG AGT AAG TCA CAG AAA GAC AGC 1084 
Ala Ala Thr Val Ser He Ala Glu Trp Ser Lys Ser Gin Lys Asp Ser 
300 305 310 

CAA GGC TAC TGT GCC CAG TAC AGA GGG GAG GTG TGT GAT GCA GTC CTG 1132 
Gin Gly Tyr Cys Ala Gin Tyr Arg Gly Glu Val Cys Asp Ala Val Leu 
315 320 325 

GCG AAA GAT GCT CTT GTC TTC TTC AAC ACC TCC TAC CGG GAC CCC GAG laSO 
Ala Lys Asp Ala Leu Val Phe Phe Asn Thr Ser Tyr Arg Asp Pro Glu 
330 335 340 345 

GAC GCC CAG GAG CTG CTG ATC CAC ACT GCG TGG AAT GAG CTG AAG GCT 1228 
Asp Ala Gin Glu Leu Leu He Hie Thr Ala Trp Asn Glu Leu Lys Ala 
350 355 360 

GTG AGT CCA CTG TGC CGG CCA GCT GCT GAG GCT CTG CTG TGT AAC CAC 1276 
Val Ser Pro Leu Cys Arg Pro Ala Ala Glu Ala Leu Leu Cys Asn His 
365 370 375 

CTC TTC CAA GAG TGC AGC CCT GGA GTG GTA CCT ACT CCC ATG CCC ATT 1324 
Leu Phe Gin Glu Cys Ser Pro Gly Val Val Pro Thr Pro Met Pro He 
380 385 390 

TGC AGA GAG TAC TGC CTG GCG GTA AAG GAG CTC TTC TGT GCA AAG GAA 1372 
Cys Arg Glu Tyr Cys Leu Ala Val Lys Glu Leu Phe Cys Ala Lys Glu 
395 400 405 

TGG CAG GCA ATG GAA GGA AAG GCC CAC CGG GGC CTC TAC AGA TCT GGG 1420 
Trp Gin Ala Met Glu Gly Lys Ala His Arg Gly Leu Tyr Arg Ser Gly 
410 415 420 425 

ATG CAT CTC CTT CCG GTA CCA GAG TGC AGC AAG CTT CCC AGC ATG CAC 1468 
Met His Leu Leu Pro Val Pro Glu Cys Ser Lys Leu Pro Ser Met His 
430 435 440 

CGG GAC CCC ACA GCC TGC ACA AGA CTG CCA TAT TTA GAT TAT AAA AAA 1516 
Arg Asp Pro Thr Ala Cys Thr Arg Leu Pro Tyr Leu Asp Tyr Lys Lys 
445 450 455 
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GAA AAC ATA ACA ACA TTC CCG TCA ATA ACG TCC TCC AGG CCG AGC GCG 1664 
Glu Asn He Thr Thr Phe Pro Ser He Thr Ser Ser Arg Pro Ser Ala 
460 465 470 

GAC ATT CCA AAC CTG CC 15B1 
Asp He Pro Asn Leu 
475 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 478 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:15: 

Met Arg Glu Leu Val Asn He Pro Leu Leu Gin Met Leu Thr Leu Val 
1 5 . 10 15 

Ala Phe Ser Gly Thr Glu Lys Leu Pro Lys Ala Pro Val He Thr Thr 
20 25 30 

Pro Leu Glu Thr Val Asp Ala Leu Val Glu Glu Val Ala Thr Phe Met 
35 40 45 

Cys Ala Val Glu Ser Tyr Pro Gin Pro Glu He Ser Trp Thr Arg Asn 
50 55 60 . 

Lys He Leu He Lys Leu Phe Asp Thr Arg Tyr Ser He Arg Glu Asn 
65 70 75 80 

Gly Gin Leu Leu Thr He Leu Ser Val Glu Asp Ser Asp Asp Gly He 
85 90 95 

Tyr Cys Cys He Ala Asn Asn Gly Val Gly Gly Ala Val Glu Ser Cys 
100 105 110 

Gly Ala Leu Gin Val Lys Met Lys Pro Lys He Thr Arg Pro Pro He 
115 120 125 

Asn Val Lys He He Glu Gly Leu Lys Ala Val Leu Pro Cys Thr Thr 
130 135 140 

Met Gly Asn Pro Lys Pro Ser Val Ser Trp He Lys Gly Asp Asn Ala 
145 150- 155 160 

Leu Arg Glu Asn Ser Arg He Ala Val Leu Glu Ser Gly Ser Leu Arg 
165 170 175 

He His Asn Val Gin Lys Glu Asp Ala Gly Gin Tyr Arg Cys Val Ala 
180 IBS 190 

Lys Asn Ser Leu Gly Thr Ala Tyr Ser Lys Leu Val Lys Leu Glu Val 
195 200 205 

Glu Val Phe Ala Arg He Leu Arg Ala Pro Glu Ser His Asn Val Thr 
210 215 220 
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Phe Gly Ser Phe Val Thr Leu Arg Cys Thr Ala lie Gly lie Pro Val 
225 230 235 240 

Pro Thr He Ser Trp He Glu Asn Gly Asn Ala Val Ser Ser Gly Ser 
245 250 255 

He Gin Glu Ser Val Lys Asp Arg Val He Asp Ser Arg Leu Gin Leu 
260 265 270 

Phe He Thr Lys Pro Gly Leu Tyr Thr Cys He Ala Thr Asn Lys His 
275 280 285 

Gly Glu Lys Phe Ser Thr Ala Lys Ala Ala Ala Thr Val Ser He Ala 
290 295 300 

Glu Trp Ser Lys Ser Gin Lys Asp Ser Gin Gly Tyr Cys Ala Gin Tyr 
305 310 315 320 

Arg Gly Glu Val Cys Asp Ala Val Leu Ala Lys Asp Ala Leu Val Phe 
325 330 335 

Phe Asn Thr Ser Tyr Arg Asp Pro Glu Asp Ala Gin Glu Leu Leu He 
340 345 350 

His Thr Ala Trp Asn Glu Leu Lys Ala Val Ser Pro Leu Cys Arg Pro 
355 360 365 

Ala Ala Glu Ala Leu Leu Cys Asn His Leu Phe Gin Glu Cys Ser Pro 
370 375 3B0 

Gly Val Val Pro Thr Pro Met Pro He (jys Arg Glu Tyr Cys Leu Ala 
385 390 395 400 

Val Lys Glu Leu Phe Cys Ala Lys Glu Trp Gin Ala Met Glu Gly Lys 
405 410 415 

Ala His Arg Gly Leu Tyr Arg Ser Gly Met His Leu Leu Pro Val Pro 
420 425 430 

Glu Cys Ser Lys Leu Pro Ser Met His Arg Asp Pro Thr Ala Cys Thr 
435 440 445 

Arg Leu Pro Tyr Leu Asp Tyr Lys Lys Glu Asn He Thr Thr Phe Pro 
450 455 460 

Ser He Thr Ser Ser Arg Pro Ser Ala Asp He Pro Asn Leu 
465 470 475 

(2) INFORMATION FOR SBQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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vxi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 
ACACTGCGTG GAATGAGCTG A 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 
AAATATGGCA GTCTTGTGCA 



(21 INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2580 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1,.25B0 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

ATG AGA GAG CTT GTC AAC ATT CCA CTG TTA CAG ATG CTC ACC CTG GTT 4 6 

Met Arg Glu Leu Val Asn lie Pro Leu Leu Gin Met Leu Thr Leu Val 
15 10 15 

GCC TTC AGC GGG ACT GAG AAA CTT CCA AAA GCC CCT GTC ATC ACC ACG 96 
Ala Phe Ser Gly Thr Glu Lys Leu Pro Lys Ala Pro Val lie Thr Thr 
20 25 30 

CCT CTT GAA ACT GTA GAT GCC TTG GTT GAA GAA GTA GCG ACT TTC ATG 144 
Pro Leu Glu Thr Val Asp Ala Leu Val Glu Glu Val Ala Thr Phe Met 
35 40 45 

TGT GCC GTG GAA TCC TAC CCT CAG CCC GAG ATT TCT TGG ACC AGA AAT 192 
Cys Ala Val Glu Ser Tyr Pro Gin Pro Glu lie Ser Trp Thr Arg Asn 
50 55 60 

AAA ATT CTC ATT AAG CTG TTT GAC ACC CGC TAC AGC ATC CGG GAG AAT 24 0 

Lys lie Leu lie Lys Leu Phe Asp Thr Arg Tyr Ser lie Arg Glu Asn 
65 70 75 BO 



60 



wo 96/02644 



PCT/US95/0S493 



GGT CAG CTC CTC ACC ATT CTG AGC GTG GAA GAC AGT GAT GAT GGC ATC 288 
Gly Gin Leu Leu Thr lie Leu Ser Val Glu Asp Ser Asp Asp Gly lie 
85 90 ' 95 

TAG TGC TGC ATA GCC AAC AAT GGA GTG GGA GGA GGC GTG GAG AGT TGT 3 36 

Tyr Cys Cys lie Ala Asn Asn Gly Val Gly Gly Ala Val Glu Ser Cys 
100 105 110 

GGT GCC CTG CAA GTG AAG ATG AAA CCT AAA ATA ACT CGT CCT CCC ATT 384 
Gly Ala Leu Gin Val Lys Met Lys Pro Lys lie Thr Arg Pro Pro lie 
115 120 125 

AAT GTA AAA ATA ATA GAG GGA TTG AAG GCA GTT CTG CCG TGC ACT ACG 4 32 

Asn Val Lys lie lie Glu Gly Leu Lys Ala Val Leu Pro Cys Thr Thr 
130 135 140 

ATG GGT AAC CCC AAA CCA TCT GTG TCC TGG ATC AAG GGG GAC AAT GCT 4 80 

Met Gly Asn Pro Lys Pro Ser Val Ser Trp lie Lys Gly Asp Asn Ala 
145 150 155 160 

CTC AGG GAA AAT TCC AGA ATC GCA GTT CTT GAA TCT GGG AGC TTA AGG 528 
Leu Arg Glu Asn Ser Arg He Ala Val Leu Glu Ser Gly Ser Leu Arg 
165 170 175 

ATC CAT AAT GTG CAA AAG GAA GAT GCA GGA CAG TAC CGC TGT GTG GCC 576 
He His Asn Val Gin Lys Glu Asp Ala Gly Gin Tyr Arg Cys Val Ala 
180 185 190 

AAA AAC AGC CTG GGC ACA GCT TAC TCC AAA CTG GTG AAG CTG GAA GTG 624 
Lys Asn, Ser Leu Gly Thr Ala Tyr Ser Lys Leu Val Lys Leu Glu Val 
195 200 205 

GAG GTT TTT GCA AGA ATC CTG CGT GCT CCT GAA TCC CAC AAT GTC ACC 672 
Glu Val Phe Ala Arg lie Leu Arg Ala Pro Glu Ser His Asn Val Thr 
210 215 220 

TTT GGT TCC TTT GTA ACC CTA CGC TGC ACA GCA ATA GGC ATC CCT GTC 720 
Phe Gly Ser Phe Val Thr Leu Arg Cys Thr Ala He Gly He Pro Val 
225 230 235 240 

CCC ACC ATC AGC TGG ATT GAA AAC GGA AAT GCT GTT TCT TCA GGT TCC 768 
Pro Thr He Ser Trp He Glu Asn Gly Asn Ala Val Ser Ser Gly Ser 
245 250 255 

ATT CAA GAG AGT GTG AAA GAC CGA GTG ATT GAC TCA AGA CTC CAG CTC 816 
He Gin Glu Ser Val Lys Asp Arg Val He Asp Ser Arg Leu Gin Leu 
260 265 270 

TTC ATC ACA AAG CCA GGA CTC TAC ACA TGC ATA GCT ACC AAT AAG CAC 6*64 
Phe He Thr Lys Pro Gly Leu Tyr Thr Cys He Ala Thr Asn Lys His 
275 280 285 

GGA GAA AAG TTC AGT ACC GCA AAG GCT GCA GCC ACT GTC AGC ATA GCA 912 
Gly Glu Lys Phe Ser Thr Ala Lys Ala Ala Ala Thr Val Ser He Ala 
290 295 300 

GAA TGG AGT AAG TCA CAG AAA GAC AGC CAA GGC TAC TGT GCC CAG TAC 960 
Glu Trp Ser Lys Ser Gin Lys Asp Ser Gin Gly Tyr Cys Ala Gin Tyr 
305 310 315 320 
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AGA GGG GAG GTG TGT GAT GCA GTC CTG GCG AAA GAT GCT CTT GTC TTC 1008 
Arg Gly Glu Val Cys Asp Ala Val Leu Ala Lys Asp Ala Leu Val Phe 
325 330 335 

TTC AAC ACC TCC TAG CGG GAC CCC GAG GAG GCC CAG GAG CTG CTG ATC 1056 
Phe Asn Thr Ser Tyr Arg Asp Pro Glu Asp Ala Gin Glu Leu Leu lie 
340 345 350 

CAC ACT GCG TGG AAT GAG CTG AAG GCT GTG AGT CCA CTG TGC CGG CCA 1104 
His Thr Ala Trp Asn Glu Leu Lys Ala Val Ser Pro Leu Cys Arg Pro 
355 360 365 

GCT GCT GAG GCT CTG CTG TGT AAC CAC CTC TTC CAA GAG TGC AGC CCT 1152 
Ala Ala Glu Ala Leu Leu Cys Asn His Leu Phe Gin Glu Cys Ser Pro 
370 375 380 

GGA GTG GTA CCT ACT CCC ATG CCC ATT TGC AGA GAG TAC TGC CTG GCG 1200 
Gly Val Val Pro Thr Pro Met Pro lie Cys Arg Glu Tyr Cys Leu Ala 
385 390 395 400 

GTA AAG GAG CTC TTC TGT GCA AAG GAA TGG CAG GCA ATG GAA GGA AAG 124 8 

Val Lys Glu Leu Phe Cys Ala Lys Glu Trp Gin Ala Met Glu Gly Lys 
405 410 415 

GCC CAC CGG GGC CTC TAC AGA TCT GGG ATG CAT CTC CTT CCG GTA CCA 12 96 

Ala His Arg Gly Leu Tyr Arg Ser Gly Met His Leu Leu Pro Val Pro 
420 425 430 

GAG TGC AGC AAG CTT CCC AGC ATG CAC CGG GAC CCC ACA GCC TGC ACA 1344 
Glu Cys Ser Lys Leu Pro Ser Met His Arg Asp Pro Thr Ala Cys Thr 
435 440 445 

AGA CTG CCA TAT TTA GCA TTC CCG TCA ATA ACG TCC TCC AGG CCG AGC 13 92 

Arg Leu Pro Tyr Leu Ala Phe Pro Ser lie Thr Ser Ser Arg Pro Ser 
450 455 460 

GCG GAC ATT CCA AAC CTG CCT GCC TCC ACC TCT TCC TTT GCC GTC TOG 1440 
Ala Asp lie Pro Asn Leu Pro Ala Ser Thr Ser Ser Phe Ala Val Ser 
465 470 475 480 

CCT GCG TAC TCC ATG ACC GTC ATC ATC TCC ATC GTG TCC AGC TTT GCC 1488 
Pro Ala Tyr Ser Met Thr Val lie He Ser He Val Ser Ser Phe Ala 
4B5 490 495 

CTG TTT GCT CTT CTC ACC ATC GCT ACT CTC TAT TGC TGC CGA AGG AGG 1536 
Leu Phe Ala Leu Leu Thr He Ala Thr Leu Tyr Cys Cys Arg Arg Arg 
5O0 505 510 

AAA GAA TGG AAA AAT AAG AAA AGA GAG TCG ACC GCG GTG ACC CTC ACC 1584 
Lys Glu Trp Lys Asn Lys Lys Arg Glu Ser Thr Ala Val Thr Leu Thr 
515 520 525 

ACG TTG CCT TCC GAG CTC CTG CTG GAT AGG CTC CAT CCC AAC CCC ATG 16 32 

Thr Leu Pro Ser Glu Leu Leu Leu Asp Arg Leu His Pro Asn Pro Met 
530 535 540 

TAC CAG AGG ATG CCA CTC CTT CTG AAT CCT AAG TTG CTC AGC CTG GAG 16 80 

Tyr Gin Arg Met Pro Leu Leu Leu Asn Pro Lys Leu Leu Ser Leu Glu 
545 550 555 560 
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TAT CCG AGG AAT AAC ATT GAG TAT GTC CGA GAC ATC GGA GAG GGG GCG 172 8 

Tyr Pro Arg Asn Asn He Glu Tyr Val Arg Asp He Gly Glu Gly Ala 
565 570 575 

TTT GGA AGA GTC TTC CAA GCA AGG GCC CCT GGC TTG CTG CCT TAT GAA 1776 
Phe Gly Arg Val Phe Gin Ala Arg Ala Pro Gly Leu Leu Pro Tyr Glu 
580 5B5 590 

CCT TTC ACT ATG GTG GCC GTG AAG ATG CTT AAG GAA GAG GCC TCT GCA 1824 
Pro Phe Thr Met Val Ala Val Lys Met Leu Lys Glu Glu Ala Ser Ala 
595 600 605 

GAC ATG CAA GCG GAC TTT CAG AGG GAG GCG GCC CTC ATG GCA GAG TTT 1872 
Asp Met Gin Ala Asp Phe Gin Arg Glu Ala Ala Leu Met Ala Glu Phe 
6X0 615 620 

GAC AAC CCC AAC ATC GTG AAA CTC TTA GGT GTG TGT GCC GTT GGG AAG 1920 
Asp Asn Pro Asn lie Val Lys Leu Leu Gly Val Cys Ala Val Gly Lys 
625 630 635 640 

CCG ATG TGT CTG CTC TTT GAA TAT ATG GCC TAT GGT GAC CTC AAT GAG 1968 
Pro Met Cys Leu Leu Phe Glu Tyr Met Ala Tyr Gly Asp Leu Asn Glu 
645 650 655 

TTC CTC CGA AGT ATG TCC CCG CAC ACT GTT TGC AGC CTC AGC CAC AGT 2016 
Phe Leu Arg Ser Met Ser Pro His Thr Val Cys Ser Leu Ser His Ser 
660 665 670 

GAC CTG TCC ACG AGG GCT CGG GTG TCT AGC CCT GGT CCT CCA CCA CTG 2064 
Asp Leu Ser Thr Arg Ala Arg Val Ser Ser Pro Gly Pro Pro Pro Leu 
675 680 685 

TCC TGT GCA GAA CAG CTC TGC ATT GCC AGG CAG GTG GCA GCT GGC ATG 2112 
Ser Cys Ala Glu Gin Leu Cys lie Ala Al^g Gin Val Ala Ala Gly Met 
690 695 700 

GCC TAC CTT TCA GAG CGC AAG TTT GTC CAC CGG GAC TTA GCT ACC AGG 2160 
Ala Tyr Leu Ser Glu Arg Lys Phe Val His Arg Asp Leu Ala Thr Arg 
705 710 715 720 

AAC TGC CTG GTT GGG GAG ACC ATG GTG GTG AAA ATT GCA GAC TTT GGC 2208 
Asn Cys Leu Val Gly Glu Thr Met Val Val Lys He Ala Asp Phe Gly 
725 730 735 

CTC TCC AGG AAC ATC TAT TCC GCA GAC TAC TAC AAA GCT GAT GGA AAT 2256 
Leu Ser Arg Asn He Tyr Ser Ala Asp Tyr Tyr Lys Ala Asp Gly Asn 
740 745 750 

GAC GCC ATC CCT ATC CGC TGG ATG CCG CCC GAG TCT ATC TTC TAC AAC 2304 
Asp Ala He Pro He Arg Trp Met Pro Pro Glu Ser He Phe Tyr Asn 
755 760 765 

CGC TAC ACC ACG GAG TCG GAT GTA TGG GCC TAT GGT GTG GTC CTC TGG 2352 
Arg Tyr Thr Thr Glu Ser Asp Val Trp Ala Tyr Gly Val Val Leu Trp 
770 775 780 

GAG ATC TTC TCC TAT GGG CTG CAG CCC TAC TAT GGA ATG GCC CAC GAG 2400 
Glu He Phe Ser Tyr Gly Leu Gin Pro Tyr Tyr Gly Met Ala His Glu 
785 790 795 BOO 
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GAG GTC ATT TAC TAT GTG AG A GAT GGC AAC ATC CTC GCC TGC CCT GAG 24 4 8 

Glu Val lie Tyr Tyr Val Arg Asp Gly Asn lie Leu Ala Cys Pro Glu 
805 810 815 

AAC TGC GCC TTG GAA CTG TAG AAC CTC ATG CGC CTG TGT TGG AGC AAG 24 96 

Asn Cys Pro Leu Glu Leu Tyr Asn Leu Met Arg Leu Cys Trp Ser Lys 
820 825 830 

CTG CCT GCT GAT AGA CCC AGC TTC TGC AGT ATC CAC AGG ATC CTG CAG 2 544 

Leu Pro Ala Asp Arg Pro Ser Phe Cys Ser lie His Arg lie Leu Gin 
835 840 B45 

CGC ATG TGC GAG AGA GCA GAG GGA ACG GTG GGT GTC 258 0 

Arg Met Cys Glu Arg Ala Glu Gly Thr Val Gly Val 
850 855 B60 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 860 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Arg Glu Leu Val Asn lie Pro Leu Leu Gin Met Leu Thr Leu Val 
1 5 10 . 15 , 

Ala Phe Ser Gly Thr Glu Lys Leu Pro Lys Ala Pro Val lie Thr Thr 
20 25 30 

Pro Leu Glu Thr Val Asp Ala Leu Val Glu Glu Val Ala Thr Phe Met 
35 40 45 

Cys Ala Val Glu Ser Tyr Pro Gin Pro Glu lie Ser Trp Thr Arg Asn 
50 55 €0 

Lys lie Leu lie Lys Leu Phe Asp Thr Arg Tyr Ser lie Arg Glu Asn 
65 70 75 80 

Gly Gin Leu Leu Thr lie Leu Ser Val Glu Asp Ser Asp Asp Gly lie 
85 90 95 

Tyr Cys Cys lie Ala Asn Asn Gly Val Gly Gly Ala Val Glu Ser Cys 
100 105 110 

Gly Ala Leu Gin Val Lys Met Lys Pro Lys lie Thr Arg Pro Pro lie 
115 120 125 

Asn Val Lys lie lie Glu Gly Leu Lys Ala Val Leu Pro Cys Thr Thr 
130 135 140 

Met Gly Asn Pro Lys Pro Ser Val Ser Trp lie Lys Gly Asp Asn Ala 
145 150 155 160 

Leu Arg Glu Asn Ser Arg lie Ala Val Leu Glu Ser Gly Ser Leu Arg 
165 170 175 
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He His Asn Val Gin Lys Glu Asp Ala Gly Gin Tyr Arg Cys Val Ala 
180 185 190 

Lys Asn Ser Leu Gly Thr Ala Tyr Ser Lys Leu Val Lys Leu Glu Val 
195 200 205 

Glu Val Phe Ala Arg He Leu Arg Ala Pro Glu Ser His Asn Val Thr 
210 215 220 

Phe Gly Ser Phe Val Thr Leu Arg Cys Thr Ala He Gly He Pro Val 
225 230 235 240 

Pro Thr He Ser Trp He Glu Asn Gly Asn Ala Val Ser Ser Gly Ser 
245 250 255 

He Gin Glu Ser Val Lys Asp Arg Val He Asp Ser Arg Leu Gin Leu 
260 2€5 270 

Phe He Thr Lys Pro Gly Leu Tyr Thr Cys He Ala Thr Asn Lys His 
275 280 285 

Gly Glu Lys Phe Ser Thr Ala Lys Ala Ala Ala Thr Val Ser He Ala 
290 295 300 

Glu Trp Ser Lys Ser Gin Lys Asp Ser din Gly Tyr Cys Ala Gin Tyr 
305 310 315 320 

Arg Gly Glu Val Cys Asp Ala Val Leu Ala Lys Asp Ala Leu Val Phe 
325 330 335 

Phe Asn Thr Ser Tyr Arg Asp Pro Glu Asp Ala Gin Glu Leu I-eu He 
340 345 350 

His Thr Ala Trp Asn Glu Leu Lys Ala Val Ser Pro Leu Cys Arg Pro 
355 360 365 

Ala Ala Glu Ala I*eu Leu Cys Asn His Leu Phe Gin Glu Cys Ser Pro 
370 375 380 

Gly Val Val Pro Thr Pro Met Pro He Cys Arg Glu Tyr Cys Leu Ala 
385 390 395 400 

Val Lys Glu Leu Phe Cys Ala Lys Glu Trp Gin Ala Met Glu Gly Lys 
405 410 415 

Ala His Arg Gly Leu Tyr Arg Ser Gly Met His Leu Leu Pro Val Pro 
420 425 430 

Glu Cys Ser Lys Leu Pro Ser Met His Arg Asp Pro Thr Ala Cys Thr 

435 440 445 

Arg Leu Pro Tyr Leu Ala Phe Pro Ser He Thr Ser Ser Arg Pro Ser 
450 455 460 

Ala Asp He Pro Asn Leu Pro Ala Ser Thr Ser Ser Phe Ala Val Ser 
465 470 475 480 

Pro Ala Tyr Ser Met Thr Val He He Ser He Val Ser Ser Phe Ala 
485 490 495 

Leu Phe Ala Leu Leu Thr He Ala Thr Leu Tyr Cys Cys Arg Arg Arg 
500 505 510 
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Lys Giu Trp Lys Asn Lys Lys Arg Glu Ser Thr Ala Val Thr Leu Thr 
515 520 525 

Thr Leu Pro Ser Glu Leu Leu Leu Asp Arg Leu His Pro Asn Pro Met 
530 535 ' 540 

Tyr Gin Arg Met Pro Leu Leu Leu Asn Pro Lys Leu Leu Ser Leu Glu 
545 550 555 560 

Tyr Pro Arg Asn Asn lie Glu Tyr Val Arg Asp lie Gly Glu Gly Ala 
565 570 575 

Phe Gly Arg Val Phe Gin Ala Arg Ala Pro Gly Leu Leu Pro Tyr Glu 
580 585 590 

Pro Phe Thr Met Val Ala Val Lys Met Leu Lys Glu Glu Ala Ser Ala 
595 600 605 

Asp Met Gin Ala Asp Phe Gin Arg Glu Ala Ala Leu Met Ala Glu Phe 
610 615 620 

Asp Asn Pro Asn lie Val. Lys Leu Leu Gly Val Cys Ala Val Gly Lys 
625 630 635 640 

Pro Met Cys Leu Leu Phe Glu Tyr Met Ala Tyr Gly Asp Leu Asn Glu 
645 650 655 

Phe Leu Arg Ser Met Ser Pro His Thr Val Cys Ser Leu Ser His Ser 
660 665 670 

Asp Leu Ser Thr Arg Ala Arg Val Ser Ser Pro Gly Pro Pro Pro Leu 
675 680 685 

Ser Cys Ala Glu Gin Leu Cys lie Ala Arg Gin Val Ala Ala Gly Met 
690 695 700 

Ala Tyr Leu Ser Glu Arg Lys Phe Val His Arg Asp Leu Ala Thr Aro 
705 710 715 720 

Asn Cys Leu Val Gly Glu Thr Met Val Val Lys He Ala Asp Phe Gly 
725 730 735 

Leu Ser Arg Asn He Tyr Ser Ala Asp Tyr Tyr Lys Ala Asp Gly Asn 
740 745 750 

Asp Ala He Pro He Arg Trp Met Pro Pro Glu Ser He Phe Tyr Asn 
755 760 765 

Arg Tyr Thr Thr Glu Ser Asp Val Trp Ala Tyr Gly Val Val Leu Trp 
770 775 780 

Glu He Phe Ser Tyr Gly Leu Gin Pro Tyr Tyr Gly Met Ala His Glu 
7B5 790 795 800 

Glu Val He Tyr Tyr Val Arg Asp Gly Asn He Leu Ala Cys Pro Glu 
805 810 815 

Asn Cys Pro Leu Glu Leu Tyr Asn Leu Met Arg Leu Cys Trp Ser Lys 
820 825 830 

Leu Pro Ala Asp Arg Pro Ser Phe Cys Ser He His Arg He Leu Gin 
835 B40 845 
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Arg Met Cys Glu Arg Ala Glu Gly Thr Val Gly Val 
850 855 860 

(2) INFORMATION PGR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2604 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doilble 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..2604 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

ATG AGA GAG CTT GTC AAC ATT CCA CTG TTA CAG ATG CTC ACC CTG GTT 48 
Met Arg Glu Leu Val Asn lie Pro Leu Leu Gin Met Leu Thr Leu Val 
15 10 15 

GCC TTC AGC GGG ACT GAG AAA CTT CCA AAA GCC CCT GTC ATC ACC ACG 96 
Ala Phe Ser Gly Thr Glu Lys Leu Pro Lys Ala Pro Val He Thr Thr 
20 25 30 

CCT CTT GAA ACT GTA GAT GCC TTG GTT GAA GAA GTA GCG ACT TTC ATG 144 
Pro Lieu Glu Thr Val Asp Ala Leu Val Glu Glu Val Ala Thr Phe Met 
35 40 45 

TGT GCC GTG GAA TCC TAG CCT CAG CCC GAG ATT TCT TGG ACC AGA-AAT 192 
Cys Ala Val Glu Ser Tyr Pro Qln Pro Qlu lie Ser Trp Thr Arg Asn 
50 55 60 

AAA ATT CTC ATT AAG CTG TTT GAC ACC CGC TAC AGC ATC CGG GAG AAT 24 0 

Lys He Leu He Lys Leu Phe Asp Thr Arg Tyr Ser He Arg Glu Asn 
€5 70 75 80 



GGT CAG CTC CTC ACC ATT CTG AGC GTG GAA GAC AGT GAT GAT GGC ATC 2 88 

Gly Gin Leu Leu Thr He Leu Ser Val Glu Asp Ser Asp Asp Gly He 
B5 90 95 

TAC TGC TGC ATA GCC AAC AAT GGA GTG GGA GGA GCC GTG GAG AGT TGT 336 
Tyr Cys Cys He Ala Asn Asn Gly Val Gly Gly Ala Val Glu Ser Cys 
100 105 110 

GGT GCC CTG CAA GTG AAG ATG AAA CCT AAA ATA ACT CGT CCT CCC ATT 3 84 

Gly Ala Leu Gin Val Lys Met Lys Pro Lys He Thr Arg Pro Pro He 
115 120 125 

AAT GTA AAA ATA ATA GAG GGA TTG AAG GCA GTT CTG CCG TGC ACT ACG ,32 
Asn Val Lys He He Glu Gly Leu Lys Ala Val Leu Pro Cys Thr Thr 
130 135 140 

ATG GGT AAC CCC AAA CCA TCT GTG TCC TGG ATC AAG GGG GAC AAT GCT 4 SO 

Met Gly Asn Pro Lys Pro Ser Val Ser Trp He Lys Gly Asp P^sn Ala 
145 150 155 16C 
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CTC AGG GAA AAT TCC AGA ATC GCA GTT CTT GAA TCT GGG AGC TTA AGG 52 i 

Leu Arg Glu Asn Ser Arg lie Ala Val Leu Glu Ser Glv Ser Leu Arc 
165 170 ' 175 

ATC CAT AAT GTG CAA AAG GAA GAT GCA GGA CAG TAC CGC TGT GTG GCC 5 7, 

He His Asn Val Gin Lys Glu Asp Ala Gly Gin Tyr Arg Cys Val Ala 
180 185 190 

AAA AAC AGC CTG GGC ACA GCT TAC TCC AAA CTG GTG AAG CTG GAA GTG 6 2** 

Lys Asn Ser Leu Gly Thr Ala Tyr Ser Lys Leu Val Lvs Leu Glu Val 
195 200 205 

GAG GTT TTT GCA AGA ATC CTG CGT GCT CCT GAA TCC CAC AAT GTC ACC 6 72 

Glu Val Phe Ala Arg He Leu Arg Ala Pro Glu Ser His Asn Val Thr 
210 215 220 

TTT GGT TCC TTT GTA ACC CTA CGC TGC ACA GCA ATA GGC ATC CCT GTC 720 
Phe Gly Ser Phe Val Thr Leu Arg Cys Thr Ala He Gly He Pro Val 
225 230 235 240 

CCC ACC ATC AGC TGG ATT GAA AAC GGA AAT GCT GTT TCT TCA GGT TCC 768 
Pro Thr He Ser Trp He Glu Asn Gly Asn Ala Val Ser Ser Gly Ser 
245 250 255 

ATT CAA GAG AGT GTG AAA GAC CGA GTG ATT GAC TCA AGA CTC CAG CTC 816 
He Gin Glu Ser Val Lys Asp Arg Val He Asp Ser Arg Leu Gin Leu 
260 265 270 

TTC ATC ACA AAG CCA GGA CTC TAC ACA TGC ATA GCT ACC AAT AAG CAC 864 
Phe He Thr Lys Pro Gly Leu Tyr Thr Cys He Ala Thr Asn Lys His 
275 280 285 

GGA GAA AAG TTC AGT ACC GCA AAG GCT GCA GCC ACT GTC AGC ATA GCA 912 
Gly Glu Lys Phe Ser Thr Ala Lys Ala Ala Ala Thr Val Ser He Ala 
290 295 300 

GAA TGG AGT AAG TCA CAG AAA GAC AGC CAA GGC TAC TGT GCC CAG TAC 960 
Glu Trp Ser Lys Ser Gin Lys Asp Ser Gin Gly Tyr Cys Ala Gin Tyr 
305 310 315 320 

AGA GGG GAG GTG TGT GAT GCA GTC CTG GCG AAA GAT GCT CTT GTC TTC 1008 
Arg Gly Glu Val Cys Asp Ala Val Leu Ala Lys Asp Ala Leu Val Phe 
325 330 335 

TTC AAC ACC TCC TAC CGG GAC CCC GAG GAC GCC CAG GAG CTG CTG ATC 105-* 
Phe Asn Thr Ser Tyr Arg Asp Pro Glu Asp Ala Gin Glu Leu Leu He 
340 345 350 

CAC ACT GCG TGG AAT GAG CTG AAG GCT GTG AGT CCA CTG TGC CGG CCA 1104 
His Thr Ala Trp Asn Glu Leu Lys Ala Val Ser Pro Leu Cys Arq Pro 
355 360 365 

GCT GCT GAG GCT CTG CTG TGT AAC CAC CTC TTC CAA GAG TGC AGC CCT 1152 
Ala Ala Glu Ala Leu Leu Cys Asn His Leu Phe Gin Glu Cys Ser Pro 
370 375 380 

GGA GTG GTA CCT ACT CCC ATG CCC ATT TGC AGA GAG TAC TGC CTG GCG 12 OC 

Gly Val Val Pro Thr Pro Met Pro He Cys Arg Glu Tyr Cys Leu Ala 
385 390 395 40C 
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GTA AAG GAG CTC TTC TGT GCA AAG GAA TGG CAG GCA ATG GAA GGA AAG 12 4 8 

Val Lys Glu Leu Phe Cys Ala Lys Glu Trp Gin Ala Met Glu Gly Uys 
405 410 415 

GCC CAC CGG GGC CTC TAC AGA TCT GGG ATG CAT CTC CTT CCG GTA CCA 12 96 

Ala His Arg Gly Leu Tyr Arg Ser Gly Met His Leu Leu Pro Val Pro 
420 425 430 

GAG TGC AGC AAG CTT CCC AGC ATG CAC CGG GAC CCC ACA GCC TGC ACA 1344 
Glu Cys Ser Lys Leu Pro Ser Met His Arg Asp Pro Thr Ala Cys Thr 
435 440 445 

AGA CTG CCA TAT TTA GAT TAT AAA AAA GAA AAC ATA ACA ACA TTC CCG 1392 
Arg Leu Pro Tyr Leu Asp Tyr Lys Lys Glu Asn lie Thr Thr Phe Pro 
450 455 460 

TCA ATA ACG TCC TCC AGG CCG AGC GCG GAC ATT CCA AAC CTG CCT GCC 1440 
Ser lie Thr Ser Ser Arg Pro Ser Ala Asp lie Pro Asn Leu Pro Ala 
465 470 475 480 

TCC ACC TCT TCC TTT GCC GTC TCG CCT GCG TAC TCC ATG ACC GTC ATC 1488 
Ser Thr Ser Ser Phe Ala Val Ser Pro Ala Tyr Ser Met Thr Val lie 
485 490 495 

ATC TCC ATC GTG TCC AGC TTT GCC CTG TTT GCT CTT CTC ACC ATC GCT 1536 
lie Ser lie Val Ser Ser Phe Ala Leu Phe Ala Leu Leu Thr lie Ala 
500 505 510 

ACT CTC TAT TGC TGC CGA AGG AGG AAA GAA TGG AAA AAT AAG AAA AGA 1584 
Thr Leu Tyr Cys Cys Arg Arg Arg Lys Glu Trp Lys Asn Lys Lys Arg 
515 520 525 

GAG TCG ACC GCG GTG ACC CTC ACC ACG TTG CCT TCC GAG CTC CTG CTG 1632 
Glu Ser Thr Ala Val Thr Leu Thr Thr Leu Pro Ser Glu Leu X<eu Leu 
530 535 540 

GAT AGG CTC CAT CCC AAC CCC ATG TAC CAG AGG ATG CCA CTC CTT CTG 1680 
Asp Arg Leu His Pro Asn Pro Met Tyr Gin Arg Met Pro Leu Leu Leu 
545 550 555 560 

AAT CCT AAG TTG CTC AGC CTG GAG TAT CCG AGG AAT AAC ATT GAG TAT 1728 
Asn Pro Lys Leu Leu Ser Leu Glu Tyr Pro Arg Asn Asn He Glu Tyr 
565 570 575 

GTC CGA GAC ATC GGA GAG GGG GCG TTT GGA AGA GTC TTC CAA GCA AGG 1776 
Val Arg Asp He Gly Glu Gly Ala Phe Gly Arg Val Phe Gin Ala Arg 
580 585 590 

GCC CCT GGC TTG CTG CCT TAT GAA CCT TTC ACT ATG GTG GCC GTG AAG 1824 
Ala Pro Gly Leu Leu Pro Tyr Glu Pro Phe Thr Met Val Ala Val Lys 
595 600 605 

ATG CTT AAG GAA GAG GCC TCT GCA GAC ATG CAA GCG GAC TTT CAG AGG 1872 
Met Leu Lys Glu Glu Ala Ser Ala Asp Met Gin Ala Asp Phe Gin Arg 
610 615 620 

GAG GCG GCC CTC ATG GCA GAG TTT GAC AAC CCC AAC ATC GTG AAA CTC 1920 
Glu Ala Ala Leu Met Ala Glu Phe Asp Asn Pro Asn He Val Lys Leu 
625 630 635 640 
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TTA GGT GTG TGT GCC GTT GGG AAG CCG ATG TGT CTG CTC TTT GAA TAT 196 6 

Leu Gly Val Cys Ala Val Gly hys Pro Met Cys Leu Leu Phe Glu Tyr 
€45 650 €55 

ATG GCC TAT GGT GAC CTC AAT GAG TTC CTC CGA AGT ATG TCC CCG CAC 2016 
Met Ala Tyr Gly Asp Leu Asn Glu Phe Leu Arg Ser Met Ser Pro His 
660 665 670 

ACT GTT TGC AGC CTC AGC CAC AGT GAC CTG TCC ACG AGG GCT CGG GTG 2064 
Thr Val Cys Ser Leu Ser His Ser Asp Leu Ser Thr Arg Ala Arg Val 
675 680 685 

TCT AGC CCT GGT CCT CCA CCA CTG TCC TGT GCA GAA CAG CTC TGC ATT 2112 
Ser Ser Pro Gly Pro Pro Pro Leu Ser Cys Ala Glu Gin Leu Cys He 
690 695 700 

GCC AGG CAG GTG GCA GCT GGC ATG GCC TAC CTT TCA GAG CGC AAG TTT 216C 
Ala Arg Gin Val Ala Ala Gly Met Ala Tyr Leu Ser Glu Arg Lys Phe 
705 710 715 720 

GTC CAC CGG GAC TTA GCT ACC AGG AAC TGC CTG GTT GGG GAG ACC ATG 2208 
Val His Arg Asp Leu Ala Thr Arg Asn Cys Leu Val Gly Glu Thr Met 
725 730 735 

GTG GTG AAA ATT GCA GAC TTT GGC CTC TCC AGG AAC ATC TAT TCC GCA 2256 
Val Val Lys He Ala Asp Phe Gly Leu Ser Arg Asn lie Tyr Ser Ala 
740 745 750 

GAC TAC TAC AAA GCT GAT GGA AAT GAC GCC ATC CCT ATC CGC TGG ATG 2304 
Asp Tyr Tyr Lys Ala Asp Gly Asn Asp Ala He Pro He Arg Trp Met 
755 760 765 

CCG CCC GAG TCT ATC TTC TAC AAC CGC TAC ACC ACG GAG TCG GAT GTA 2352 
Pro Pro Glu Ser He Phe Tyr Asn Arg Tyr Thr Thr Glu Ser Asp Val 
770 775 780 

TGG GCC TAT GGT GTG GTC CTC TGG GAG ATC TTC TCC TAT GGG CTG CAG 2400 
Trp Ala Tyr Gly Val Val Leu Trp Glu He Phe Ser Tyr Gly Leu Gin 
785 790 795 800 

CCC TAC TAT GGA ATG GCC CAC GAG GAG GTC ATT TAC TAT GTG AGA GAT 2448 
Pro Tyr Tyr Gly Met Ala His Glu Glu Val He Tyr Tyr Val Arg Asp 
805 810 815 

GGC AAC ATC CTC GCC TGC CCT GAG AAC TGC CCC TTG GAA CTG TAC AAC 24 96 

Gly Asn He Leu Ala Cys Pro Glu Asn Cys Pro Leu Glu Leu T>^r Asn 
820 825 830 

CTC ATG CGC CTG TGT TGG AGC AAG CTG CCT GCT GAT AGA CCC AGC TTC 2544 
Leu Met Arg Leu Cys Trp Ser Lys Leu Pro Ala Asp Arg Pro Ser Phe 
835 840 845 

TGC AGT ATC CAC AGG ATC CTG CAG CGC ATG TGC GAG AGA GCA GAG GGA 25 92 

Cys Ser He His Arg He Leu Gin Arg Met Cys Glu Arg Ala Glu Gly 
650 855 860 

ACG GTG GGT GTC 2604 

Thr Val Gly Val 

865 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 866 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21: 

Met Arg Glu Leu Val Asn He Pro Leu Leu Gin Met Leu Thr Leu Val 
15 10 15 

Ala Phe Ser Gly Thr Glu Lys Leu Pro Lys Ala Pro Val He Thr Thr 
20 25 30 

Pro Leu Glu Thr Val Asp Ala Leu val Glu Glu Val Ala Thr Phe Met 
35 40 45 

Cys Ala Val Glu Ser Tyr Pro Gin Pro Glu He Ser Trp Thr Arg Asn 
50 55 60 

Lys He Leu He Lys Leu Phe Asp Thr Arg Tyr Ser He Arg Glu Asn 
65 70 75 80 

Gly Gin Leu Leu Thr He Leu Ser Val Glu Asp Ser Asp Asp Gly He 
85 90 95 

Tyr Cys Cys He Ala Asn Asn Gly Val Gly Gly Ala Val Glu Ser Cys 
lOa 105 110 

Gly Ala Leu Gin Val Lys Met Lys Pro Lys He Thr Arg Pro Pro He 
115 120 125 

Asn Val Lys He He Glu Gly Leu Lys Ala Val Leu Pro Cys Thr- Thr 
130 135 140 

Met Gly Asn Pro Lys Pro Ser Val Ser Trp He Lys Gly Asp Asn Ala 
145 150 155 160 

Leu Arg Glu Asn Ser Arg He Ala Val Leu Glu Ser Gly Ser Leu Arg 
165 170 175 

He His Asn Val Gin Lys Glu Asp Ala Gly Gin Tyr Arg Cys Val Ala 
180 185 190 

Lys Asn Ser Leu Gly Thr Ala Tyr Ser Lys Leu Val Lys Leu Glu Val 
195 200 205 

Glu Val Phe Ala Arg He Leu Arg Ala Pro Glu Ser His Asn Val Thr 
210 215 220 

Phe Gly Ser Phe Val Thr Leu Arg Cys Thr Ala He Gly He Pro Val 
225 230 235 240 

Pro Thr He Ser Trp He Glu Asn Gly Asn Ala Val Ser Ser Gly Ser 
245 250 255 

He Gin Glu Ser Val Lys Asp Arg Val He Asp Ser Arg Leu Gin Leu 
260 265 270 
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Phe lie Thr Lys Pro Gly Leu Tyr Thr Cys lie Ala Thr Asn Lys His 
275 2B0 285 

Gly Glu Lys Phe Ser Thr Ala Lys Ala Ala Ala Thr Val Ser He Ala 
290 295 300 

Glu Trp Ser Lys Ser Gin Lys Asp Ser Gin Gly Tyr Cys Ala Gin Tyr 
305 310 315 320 

Arg Gly Glu Val Cys Asp Ala Val Leu Ala Lys Asp Ala Leu Val Phe 
325 330 335 

Phe Asn Thr Ser Tyr Arg Asp Pro Glu Asp Ala Gin Glu Leu Leu He 
340 345 350 

His Thr Ala Trp Asn Glu Leu Lys Ala Val Ser Pro Leu Cys Arg Pro 
355 360 365 

Ala Ala Glu Ala Leu Leu Cys Asn His Leu Phe Gin Glu Cys Ser Pro 
370 375 380 

Gly Val Val Pro Thr Pro Met Pro He Cys Arg Glu Tyr Cys Leu Ala 
3B5 390 395 400 

Val Lys Glu Leu Phe Cys Ala Lys Glu Trp Gin Ala Met Glu Gly Lys 
405 410 415 

Ala His Arg Gly Leu Tyr Arg Ser Gly Met His Leu Leu Pro Val Pro 
420 425 430 

Glu Cys Ser Lys Leu Pro Ser Met His Arg Asp Pro Thr Ala Cys Thr 
435 440 445 

Arg Leu Pro Tyr Leu Asp Tyr Lys Lys Glu Asn He Thr Thr Phe Pro 
450 455 460 

Ser He Thr Ser Ser Arg Pro Ser Ala Asp He Pro Asn Leu Pro Ala 
465 470 475 480 

Ser Thr Ser Ser Phe Ala Val Ser Pro Ala Tyr Ser Met Thr Val He 
485 490 495 

He Ser He Val Ser Ser Phe Ala Leu Phe Ala Leu Leu Thr He Ala 
500 505 510 

Thr Leu Tyr Cys Cys Arg Arg Arg Lys Glu Trp Lys Asn Lys Lvs Arg 
515 520 525 

Glu Ser Thr Ala Val Thr Leu Thr Thr Leu Pro Ser Glu Leu Leu Leu 
530 535 540 

Asp Arg Leu His Pro Asn Pro Met Tyr Gin Arg Met Pro Leu Leu Leu 
545 550 555 560 

Asn Pro Lys Leu Leu Ser Leu Glu Tyr Pro Arg Asn Asn He Glu Tyr 
565 570 575 

Val Arg Asp He Gly Glu Gly Ala Phe Gly Arg Val Phe Gin Ala Arg 
580 585 590 

Ala Pro Gly Leu Leu Pro Tyr Glu Pro Phe Thr Met Val Ala Val Lys 
595 600 605 
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Met Leu Lys Glu Glu Ala Ser Ala Asp Met Gin Ala Asp Phe Gin Arg 
610 61S 620 

Glu Ala Ala Leu Met Ala Glu Phe Asp Asn Pro Asn lie Val Lys Leu 
€25 630 635 640 

Leu Gly Val Cys Ala Val Gly Lys Pro Met Cys Leu Leu Phe Glu Tyr 
645 650 655 

Met Ala Tyr Gly Asp Leu Asn Glu Phe Leu Arg Ser Met Ser Pro His 
660 665 670 

Thr Val Cys Ser Leu Ser His Ser Asp Leu Ser Thr Arg Ala Arg Val 
675 680 685 

Ser Ser Pro Gly Pro Pro Pro Leu Ser Cys Ala Glu Gin Leu Cys lie 
690 695 700 

Ala Arg Gin Val Ala Ala Gly Met Ala Tyr Leu Ser Glu Arg Lys Phe 
705 710 715 720 

Val His Arg Asp Leu Ala Thr Arg Asn Cys Leu Val Gly Glu Thr Met 
725 730 735 

Val Val Lys lie Ala Asp Phe Gly Leu Ser Arg Asn lie Tyr Ser Ala 
740 745 750 

Asp Tyr Tyr Lys Ala Asp Gly Asn Asp Ala lie Pro lie Arg Trp Met 
755 760 765 

Pro Pro Glu Ser lie Phe Tyr Asn Arg Tyr Thr Thr Glu Ser Asp Val 
770 775 780 

Trp Ala Tyr Gly Val Val XjOU Trp Glu lie Phe Ser Tyr Gly Leu Gin 
785 790 795 800 

Pro Tyr Tyr Gly Met Ala His Glu Glu Val He Tyr Tyr Val Arg Asp 
605 810. 815 

Gly Asn He Leu Ala Cys Pro Glu Asn Cys Pro Leu Glu Leu Tyr Asn 
820 825 830 

Leu Met Arg Leu Cys Trp Ser Lys Leu Pro Ala Asp Arg Pro Ser Phe 
835 840 845 

Cys Ser He His Arg lie Leu Gin Arg Met Cys Glu Arg Ala Glu Gly 
850 855 860 

Thr Val Gly Val 
865 
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What is claimed is: 

1 . An isolated polynucleotide comprising a nucleotide sequence selected from the group 
consisting of: 

(a) the nucleotide sequence of SEQ ID N0:1 from nucleotide 13 to nucleotide 
1602; • 

(b) the nucleotide sequence of SEQ ID NO: 18 from nucleotide I to nucleotide 

2580; 

(c) the nucleotide sequence of SEQ ID NO: 20 from nucleotide 1 to nucleotide 

2604; 

(d) a nucleotide sequence capable of hybridizing to a nucleic acid sequence 
specified in (a)-(c); 

(e) a nucleotide sequence varying from the sequence of a nucleotide sequence 
specified in (a)-(c) as a result of degeneracy of the genetic code; and 

(f) an allelic variant of a nucleotide sequence specified in (a)-(c). 

2. The polynucleotide of claim 1 wherein said nucleotide sequence encodes for a protein 
having an activity selected from the group consisting of mlk receptor activity and mlk ligand 
binding activity. 

3. The polynucleotide of claim 1 wherein said nucleotide sequence is operably linked to 
an expression control sequence. 
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4. The polynucleotide of claim 1 which encodes a murine mlk protein. 

5. The polynucleotide of claim 1 which encodes a human mlk protein. 

6. The polynucleotide of claim 1 comprising the nucleotide sequence of SEQ ID NO: 1 
from nucleotide 13 to nucleotide 1602. 

7. The polynucleotide of claim 1 encoding the protein of claim 14. 

8. The polynucleotide of claim I comprising the nucleotide sequence of SEQ ID NO: 18 
from nucleotide 1 to nucleotide 2580. 

9. The polynulceotide of claim I comprising the nucleotide sequence of SEQ ID NO: 20 
from nucleotide 1 to nucleotide 2604. 

10. The polynucleotide of claim 2 which encodes a protein having mlk ligand binding 
activity. 

11. A host cell transformed with the polynucleotide of claim 3. 

12. The host cell of claim 11, wherein said cell is a manrnialian cell. 
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13. A process for producing a mlk protein, said process comprising: 

(a) growing a culture of the host cell of claim 11 in a suitable culture medium; 

and 

(b) purifying the mlk protein from the culture. 

14. An isolated mlk receptor protein comprising an amino acid sequence selected from the 
group consisting of: 



(a) 


the 


amino 


acid 


sequence 


of SEQ 


ID 


N0:2; 


(b) 


the 


amino 


acid 


sequence 


of SEQ 


ID 


NO; 2 from amino acids 1 to 156; 


(c) 


the 


amino 


acid 


sequence 


of SEQ 


ID 


N0:2 from amino acids 157 to 177: 


(d) 


the 


amino 


acid 


sequence 


of SEQ 


ID 


N0:2 from amino acids 178 to 530; 


(e) 


the 


amino 


acid 


sequence 


of SEQ 


ID 


NO: 2 from amino acids 242 to 517; 


(f) 


the 


amino 


acid 


sequence 


of SEQ 


ID 


N0:15; 


(6) 


the 


amino 


acid 


sequence 


of SEQ 


ID 


NO: 15 from amino acids 22 to 478; 


(h) 


the 


amino 


acid 


sequence 


of SEQ 


ID 


NO: 19; 


(i) 


the 


amino 


acid 


sequence 


of SEQ 


ID 


NO: 19 from amino acids 22 to 486; 


0) 


the 


amino 


acid 


sequence 


of SEQ 


ID 


NO: 19 from amino acids 487 to 507: 


(k) 


the 


amino 


acid 


sequence 


of SEQ 


ID 


NO; 19 from amino acids 508 lo 860; 


(1) 


the 


amino 


acid 


sequence 


of SEQ 


ID 


NO: 19 from amino acids 572 to 847: 


(m) 


the 


amino 


acid 


sequence 


of SEQ 


ID 


N0;21; 


(n) 


the 


amino 


acid 


sequence 


of SEQ 


ID 


N0:21 from amino acids 22 to 494; 


(0) 


the 


amino 


acid 


sequence 


of SEQ 


ID 


N0:21 from amino acids 495 to 515: 
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(p) the amino acid sequence of SEQ ID N0:21 from amino acids 516 to 868; 

(q) the amino acid sequence of SEQ ID N0:21 from amino acids 580 to 855; 

(r) fragments of (a)-(q) having mlk receptor activity; and 

(s) fragments of (a)-(q) having mlk ligand binding activity. 

15. The protein of claim 14 comprising the amino acid sequence of SEQ ID NO:2. 

16. The protein of claim 14 comprising the amino acid sequence of SEQ ID NO: 15. 

17. The protein of claim 14 comprising the amino acid sequence of SEQ ID NO: 19. 

18. The protein of claim 14 comprising the amino acid sequence of SEQ ID NO:21. 

19. The protein of claim 14 comprising a fragment having mlk ligand binding activity. 

20. A pharmaceutical composition comprising a mlk protein of claim 14 and a 
pharmaceutically acceptable carrier. 

21. A mlk protein produced according to the process of claim 13. 

22. A composition comprising an antibody which specifically reacts with a mlk protein of 
claim 14. 
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23. A method of identifying a mlk receptor ligand, said method comprising: 

(a) providing a sample containing a potential source of mlk ligand; 

(b) contacting said sample with a protein of claim 14; and 

(c) collecting materials binding to said protein. 

24. A mlk receptor ligand. 

25. The ligand of claim 24 identified according to the method of claim 23. 

26. A pharmaceutical composition comprising a ligand of claim 24 and a pharmaceutically 
acceptable carrier. 

27. An isolated polynucleotide encoding a ligand of claim 24. 

28. A host cell transformed with an expression vector comprising the polynucleotide of 
claim 27. 

29. A method of treating a mik-related condition, said method comprising administering a 
therapeutically effective amount of a composition of claim 20 to a mammalian subject. 

30. A method of inhibiting binding of a ligand to a mlk protein, said method comprising 
administering a therapeutically effective amount of a composition of claim 20. 
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31. A method of treating a mlk-related condition, said method comprising administering a 
therapeutically effective amount of a composition of claim 26 to a manunalian subject. 



32. A method of inhibiting binding of a ligand to a mlk protein, said method comprising 
administering a therapeutically effective amount of a composition of claim 26. 

33. A method of identifying an inhibitor of ligand binding to mlk protein which 
comprises: 

(a) combining a mlk protein of claim 14 with a ligand, said combination forming a 
first binding mixture; 

(b) measuring the amount of binding between the mlk protein and the mlk in the 
first binding mixture; 

(c) combining a compound with the mlk protein and the ligand to form a second 
binding mixture; 

(d) measuring the amount of binding in the second binding mixture; and 

(e) comparing the amount of binding in the first binding mixture with the amount 
of binding in the second binding mixture; 

wherein the compound is capable of inhibiting ligand binding to mlk protein when a decrease 
in the amount of binding of the second binding mixture occurs. 

34. An inhibitor identified by the method of claim 33. 
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35. A pharmaceutical composition comprising the inhibitor of claim 34 and a 
pharmaceutically acceptable carrier. 

36. A method of treating a mlk-related condition, said method comprising administering a 
therapeutically effective amount of a composition of claim 35 to a mammalian subject. 

37. A method of inhibiting binding of a ligand to a mik protein, said method comprising 
administering a therapeutically effective amount of a composition of claim 35. 

38. A composition comprising an antibody which specifically reacts with a ligand of claim 
24. 

39. A method of promoting bone or cartilage growth, said method comprising 
administering a therapeutically effective amount of a composition of claim 20 or 26. 

40. A method of inhibiting bone or cartilage growth, said method comprising 
administering a therapeutically effective amount of a composition of claim 20 or 26 

41. A method of inhibiting bone loss, said method comprising administering a 
therapeutically effective amount of a composition of claim 20 or 26. 
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